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Multi-Antigen Vectors for Melanoma 

FIFTTfc OF TH E INVENTION 

The present invention relates to multi-antigen vectors for use in preventing and / or 
5 treating cancer. In particular, the invention relates to multi-antigen vectors for use in treating 
and/or preventing melanoma. 

BACKGROUND OF THE INVENTION 

There has been tremendous increase in last few years in the development of cancer 

10 vaccines with tumour-associated antigens (TAAs) due to the great advances in identification of 
molecules based on the expression profiling on primary tumours and normal cells with the help 
of several techniques such as high density microarray, SEREX, immunohistochemistxy (IHC), 
RT-PCR, in-situ hybridization (ISH) and laser capture microscopy (Rosenberg, Immunity, 1999; 
Sgroi et al, 1999, Schena et al* 1995, Offringa et al, 2000). The TAAs are antigens expressed or 

1 5 over-expressed by tumour cells and could be specific to one or several tumours for example CEA 
antigen is expressed in colorectal, breast and lung cancers. Sgroi et al (1999) identified several 
genes differentially expressed in invasive and metastatic carcinoma cells with combined use of 
laser capture microdissection and cDNA microarrays. Several delivery systems like DNA or 
viruses could be used for therapeutic vaccination against human cancers (Bonnet et al, 2000) and 

20 can elicit immune responses and also break immune tolerance against TAAs. Tumour cells can 
be rendered more immunogenic by inserting transgenes encoding T cell co-stimulatory 
molecules such as B7.1 or cytokines such as IFN-y, IL2, or GM-CSF, among , others. Co- 
expression of a TAA and a cytokine or a co-stimulatory molecule can develop effective 
therapeutic vaccine (Hodge et al, 95, Bronte et al, 1995, Chamberlain et al, 1996). 

25 There is a need in the art for reagents and methodologies useful in stimulating an immune 

response to prevent or treat cancers. The present invention provides such reagents and 
methodologies that overcome many of the difficulties encountered by others in attempting to 
treat cancer. . 
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SUMMARY OF THE INVENTION 

The present invention provides multi-antigen vectors for adrninistration to a patient to 
prevent and /or treat cancer. In particular, the multi-antigen vector encodes one or more tumor 
antigens ('TA")- The multi-antigen vector may also encode an immune stimulator such as a co- 
stimulatory molecule and/or be administered with an adjuvant. 

BRIEF DESCRIPTION OF T TTE DRAWINGS 
Figure 1. Schematic of plasmids pALVAC.Tricpm(#33) and pT1132. 
Figure 2. DNA sequence of plasmid pALVAC.Tricom(#33). 
Figure 3. DNA sequence of plasmid pT1132. 
Figure 4. Schematic of plasmid pT3217. 
FigureS. DNA sequence of plasmid pT3217. 

Figure 6. Amino acid sequences of exemplary NY-ESO-1, TRP-2, gplOO, gplOOM, MART- 
1, MAGE-1, MAGE-3, B7.1, LFA-3, and ICAM-1 proteins. 

DETAILED DESCRIPTION 

The present invention provides reagents and methodologies useful for treating and / or 
preventing cancer. All references cited within this application are incorporated by reference. 

In one embodiment, the present invention relates to the induction or enhancement of an 
immune response against one or more tumor antigens ('TA") to prevent and / or treat cancer. In 
certain embodiments, one or more TAs may be combined. In preferred embodiments, the 
immune response results from expression of a TA in a host cell following adnunistration of a 
nucleic acid vector encoding the tumor antigen or the tumor antigen itself in the form of a 
peptide or polypeptide, for example. 

As used herein, an "antigen" is a molecule (such as a polypeptide) of a portion thereof 
that produces an immune response in a host to whom the antigen has been administered. The 
immune response may include the production of antibodies that bind to at least one epitope of the 
antigen and / or the generation of a cellular immune response against cells expressing an epitope 
of the antigen. The response may be an enhancement of a current immune response by, for 
example, causing increased antibody production, production of antibodies with increased affinity 
for the antigen, or an increase in the cellular immune response (i.e., increased number or activity 
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of immunoreactive T cells). An antigen that produces an immune response may alternatively be 
referred to as being immunogenic or as an immunogen. In describing the present invention, a 
TA may be referred to as an "immunogenic target". The present invention provide expression 
vectors for expressing in a host one or more immunogenic targets. 
5 The term TA includes both tumor-associated antigens (TAAs) and tumor-specific 
antigens (TSAs), where a cancerous cell is the source of the antigen. A TAA is. an antigen.that is 
expressed on the surface of a tumor cell in higher amounts than is observed on normal cells or an 
antigen that is expressed on normal cells during fetal development. A TSA is an antigen mat is 
unique to tumor cells and is not expressed on normal cells. TA further includes. TAAs or TSAs, 
10 antigenic fragments thereof, and modified versioris tliat retain their antigenicity. 

TAs are typically classified into five categories according to their expression pattern, 
function, or genetic origin: cancer-testis (CT) antigens (i.e., MAGE, NY-ESO-1); melanocyte 
differentiation antigens (i.e., Melan A/MART- 1, tyrosinase, gplOO); mutational antigens (i.e., 
MUM-1, p53, CDK-4); overexpressed'self antigens (i.e., HER-2/neu, p53); and, viral antigens 
15 (i.e., HPV, EBV). For the purposes of practicing the present invention, a suitable TA is any TA 
that induces or enhances an anti-tumor immune response in a host to whom the TA has been 
adnnnistered. Suitable TAs include, for example, species of gplOO (Cox et al., Science, 264:716- 
719 (1994); U.S. Pat. No. 6,500,919 Bl and WO 01/30847 with Val at residue 162, also referred 
to as "gplOOM"; U.S. Pat. No. 6,537,560 Bl with Phe at residue 162), MART-1 /Melan A 
(Kawakami et al., J. Exp. Med., 180:347-352 (1994); U.S. Pat. No. 5,874,560), gp75 (TRP-1) 
(Wang et al., J. Exp. Med., 186:1131-1140 (1996)), TRP-2 (Wang et al. 1996 J. Exp. Med. 
184:2207; U.S. Pat Nos. 5,831,016 and 6,083,783), tyrosinase (Wolfel et al:, Eur. J. Immunol, 
24:759-764 (1994); WO 200175117; WO 200175016; WO 200175007), NY-ESO-1 (WO 
98/14464; WO 99/18206; GenBank Accession No. P78358; U.S. Pat. No. 5,804,381), melanoma 
25 proteoglycan (Hellstrom et al., J. Immunol, 130:1467-1472 (1983)), MAGE family antigens 
. (i.e., MAGE-1, 2,3,4,6,12, 51; Van der Bruggenet al., Science, 254:1643-1647 (1991); U.S. Pat. 
Nos. 6,235,525; CN 131961 1), BAGE family antigens (Boel et al., Immunity, 2:167-175 (1995)), 
GAGE family antigens (i.e., GAGE-1,2; Van den Eynde et al., J.. Exp. Med., 182:689-698 
(1995); U.S. Pat. No. 6,013,765), RAGE family antigens (i.e., RAGE-1; Gaugler et at, 
30 Immunogenetics, 44:323-330 (1996); U.S. Pat. No. 5,939,526), N-acetylgliicosaminyltransferase- 
V (Guilloux et at, J. Exp. Med., 183:1173-1183 (1996)), pl5 .(Robbins et al., J. Immunol 
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154:5944-5950 (1995)), B-catenin (Robbins et al, J. Exp. Med., 183:1185-1192 (1996.)), MUM-1 
(Coulie et.al., Proc. Natl Acad. Sci. USA, 92:7976-7980 (1995)), cyclin dependent kinase^ 
(CDK4) (Wolfel et al., Science, 269:1281-1284 (1995)), p21-ras (Fossum et at., Int. J. Cancer, 
56:40-45 (1994)), BCR-oW (Bocchia et al., Blood, 85:2680-2684 (1995)), P 53 (Hieobald et al., 
Proc. Natl. Acad. Sci. USA, 92:11993-11997 (1995)), pl85 HER2/neu (erb-Bl; Fisk et al., J. 
Exp. Med., 181:2109-2117 (1995)), epidermal growth factor receptor (EGFR) (Harris , et al., 
Breast- Cancer Res. Treat, 29:1-2 (1994)), carcinoembryonic antigens (CEA) (Kwong et al., J. 
Natl. Cancer InsU, 85:982-990 (1995) U.S. Pat. Nos. 5,756,103; 5,274,087; 5,571,710; 
6,071,716; 5,698,530; 6,045,802; EP 263933; BP 346710; and, EP 784483); carcinoma- 
associated mutated mucins (i.e., MUC-1 gene products; Jerome et al., J. Immunol, 151:1654- 
1662 (1993)); EBNA gene products of EBV (i.e., EBNA-1; Rickinson et al., Cancer Surveys, 
13:53-80 (1992)); E7, E6 proteins of human papillomavirus (Ressing et al., J. Immunol, 
154:5934-5943 (1995)); prostate specific antigen {PSA; Xue et al., The Prostate, 30:73-78 
(1997)); prostate specific membrane antigen (PSMA; Israeli, et al., Cancer Res., 54:1807-1811 
(1994)); idiotypic epitopes or antigens, for example, immunoglobulin idiotypes or T cell receptor 
idiotypes (Chen et al.../. Immunol, 153:4775-4787 (1994)); KSA (U.S. Patent No. 5,348,887), 
kinesin 2 (Dietz, et al. Biochem Bibphys Res Commun 2000 Sep 7;275(3):731-8), HTP-55, 
TGFp-1 anti-apoptotic factor (Toomey, et al. Br J Biomed Sci 2001;58(3):177-83), tumor protein 
D52 (Bryne J.A., et al., Genomics, 35:523-532 (1996)), H1FT, NY-BR-1 (WO 01/47959), NY- 
BR-62, NY-BR-75, NY-BR-85, NY-BR-87, NY-BR-96 (Scanlan, M. Serologic and 
Bioinformatic Approaches to the Identification of Human Tumor Antigens, in Cancer Vaccines 
2000, Cancer Research Institute, New York, NY), including "wild-type" (i.e:, normally encoded 
by the genome, naturally-occurring), modified, and mutated versions as well as other fragments 
and derivatives thereof. Any of these TAs may be utilized alone or in combination with one 
another in a co-immunization protocol. 

Preferred TAs are useful for inducing an immune response against melanoma cells. The 
term "melanoma" includes but is not limited to melanomas, metastatic melanomas, melanomas 
derived from either melanocytes or melanocyte related nevus cells, melanocarcinomas, 
melanoepitheliomas, melanosarcomas, melanoma in situ, superficial spreading melanoma, 
nodular melanoma, lentigo maligna melanoma, acral lentiginous melanoma, invasive melanoma 
and familial atypical mole and melanoma (FAM-M) syndrome, for example. In general, 
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melanomas result from chromosomal abnormalities, degenerative growth and development 
disorders, mitogenic agents, ultraviolet radiation (UV), viral infections, inappropriate tissue 
expression of a gene, alterations in expression of a gene or carcinogenic agents, for example. 

In certain cases, it may be beneficial to co-immunize patients with both TA and other 
antigens, such as angiogenesis-associated antigens ("AA"). An AA is an immunogenic molecule 
(i.e., peptide, polypeptide) associated with cells involved in the induction and / or continued 
development of blood vessels. For example, an AA may be expressed on an endothelial cell 
("EC), which is a primary structural component of blood vessels. Where the cancer is cancer, it 
is preferred that that the AA be found within or near blood vessels that supply a tumor. 
Immunization of a patient against an AA preferably results in an anti-AA immune response 
whereby angiogenic processes that occur near or within tumors are prevented and / or inhibited. 
Exemplary AAs include, for example, vascular endothelial growth factor (i.e., VEGF; 
Bernardini, et al. J. Urol, 2001, 166(4): 1275-9; Starnes, et al. J. Thorac. Cardiovasc. Surg., 
2001, 122(3): 518-23; Dias, et al. Blood, 2002, 99: 2179-2184), the VEGF receptor {i.e., VEGF- 
R, flk-l/KDR; Starnes, et aL J. Thorac. Cardiovasc. Surg., 2001, 122(3): 518-23), EPH receptors 
(i.e., EPHA2; Gerety, et al. 1999, Cell, 4: 403-414), epidermal growth factor receptor (i.e., 
EGFR; Ciardeillo, et al. Clin. Cancer Res., 2001, 7(10): 2958-70), basic fibroblast growth factor 
(i.e., bFGF; Davidson, et al. Clin. Exp. Metastasis 2000,18(6): 501-7; Poon, et al. Am J. Surg., 
2001, 182(3):298-304), platelet-derived cell growth factor {i.e., PDGF-B), platelet-derived 
endothelial cell growth factor (PD-ECGF; Hong, et al. J. Mol. Med, 2001, 8(2):141-8), 
transforming growth factors (i.e., TGF-a; Hong, et al. J. Mol. Med., 2001, 8(2):141-8), endoglin 
(Balza, et al. Int. J. Cancer, 2001, 94: 579-585), Id proteins (Benezra, R. Trends Cardiovasc. 
Med., 2001, 1 1(6):237-41), proteases such as uPA, uPAR, and matrix metalloproteinases (MMP- 
2, MMP-9; Djonov, et al. J. Pathol., 2001, 195(2): 147-55), nitric oxide synthase (Am. J. 
Ophthalmol., 2001, 132(4):55l-£), aminopeptidase (Rouslhati, E. Nature Cancer, 2: 84-90, 
2002), thrombospondins (i.e., TSP-1, TSP-2; Alvarez, et al. Gynecol. Oncol., 2001, 82(2):273-8; 
Seki, et al. Int. J. Oncol, 2001, 19(2):305-10), k-ras (Zhang, et al. Cancer Res, 2001, 
61(16):6050-4), Wnt (Zhang, et al. Cancer Res., 2001, 61(16):6050-4), cyclin-dependent kinases 
(CDKs; Drug Resist Updat 2000, 3(2):83-88), microtubules (Timar, et al. 2001. Path. Oncol. 
Res., 7(2): 85-94), heat shock proteins (i.e., HSP90 (Timar, supra)), heparin-binding factors (i.e, 
heparinase; Gohji, et al. Int. J. Cancer, 2001, 95(5):295-301), symhases (i.e, ATP synthase, 
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thymidilate synthase), collagen receptors, integrins (i.e., ctop3, aop5, alpl, a2pl, a5pi), the 
surface proteoglycan NG2, AAC2-1, or AAC2r2, among others, including "wild-type" (i.e., 
normally encoded by the genome, naturally-occurring), modified, mutated versions as well as 
other fragments and derivatives thereof. Any of these targets may be suitable in practicing the 
present invention, either alone or in combination with one another or with other agents. 

The nucleic acid molecule may comprise or consist of a nucleotide sequence encoding 
one or more immunogenic targets, or fragments or derivatives thereof, such as that contained in a 
DNA insert in an ATCC Deposit The term Nucleic acid sequence" or "nucleic acid molecule" 
refers to a DNA or RNA sequence. The term encompasses molecules formed from any of the 
known base analogs of DNA and RNA such as, but not limited to 4-acetylcytosine, 8-hydroxy- 
N6-methyladenosine, aziridinyl-cytosine, pseudoisocytosine, 5-(carb6xyhydroxylmethyl) uracil, 
5-fluorouracil, 5-bromouracil, S^arboxyme^laminomethyl^-thiouracil, 5-carboxy- 
methylaminomethyluracil, dihydrouracil, inosine, N6-iso-pentenyladenine, 1-methyladenine, 1- 
methylpseudouracil, 1-mefoylguanine, 1-methylinosine, 2,2-dimethyl-guanine, 2-methyladenine, 
15 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyammo-methyl-2-thiouracil, beta-D-mannosylqueosine, 5' - 
methoxycarbonyl-methyluracil, 5-methoxyuracil, 2-memyltMo-N6-isopentenyladenine, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracU, 5-methyluracil, N-uracil-5- 
.20 oxyacetic acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and 
2,6-diaminopurine, among others. 

An isolated nucleic acid molecule is one mat: (1) is separated from at least about 50 
percent of proteins, lipids, carbohydrates, or other materials with which it is naturally found 
when total nucleic acid is isolated from the source cells; (2) is not be linked to all or a portion of 
25 a polynucleotide to which the nucleic acid molecule is linked in nature; (3) is operably linked to 
a polynucleotide which it is not linked to in nature; and / or, (4) does not occur in nature as part 
of a larger polynucleotide sequence. Preferably, the isolated nucleic acid molecule of the present 
invention is substantially free from any other contaminating nucleic acid molecule(s) or other 
contaminants that are found in its natural environment that would interfere with its use in 
polypeptide production or its therapeutic, diagnostic, prophylactic or research use. As used 
herein, the term "naturally occurring" or "native" or "naturally found" when used in connection 
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with biological materials such as nucleic acid molecules, polypeptides, host cells, and the like, 
refers to materials which are found in nature and are not manipulated by man. Similarly, "non- 
naturally occurring" or "non-native" as used herein refers to a material that is not found in nature 
or that has been structurally modified or synthesized by man. 

5 The identity of two or more nucleic acid or amino acid sequences is determined by 

comparing the sequences. As known in the art, "identity" means the degree of sequence 
relatedness between nucleic acid or amino acid sequences as determined by the match between 
the units making up the molecules (Le., nucleotides or amino acid residues). Identity measures 
the percent of identical matches between the smaller of two or more sequences with gap 

10 alignments (if any) addressed by a particular mathematical model or computer program (i.e., an 
algorithm). Identity between nucleic acid sequences may also be determined by the ability of the 
nucleic acid sequences to hybridize to one another. In defining the process of hybridization, the 
term "highly stringent conditions" and "moderately stringent conditions" refer to conditions that 
permit hybridization of nucleic acid strands whose sequences are complementary, and to exclude 

15 hybridization of significantly mismatched nucleic acids. Examples of "highly stringent 
conditions" for hybridization and washing are 0.015 M sodium chloride, 0.0015 M sodium 
citrate at 65-68°C or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 50% fonnamide at 
42°C. (see, for example, Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory 
Manual (2nd ed., Cold Spring Haibor Laboratory, 1989); Anderson et al y Nucleic Add 

20 Hybridisation: A Practical Approach Ch. 4 (IRL Press Limited)). The term "moderately 
stringent conditions" refers to conditions under which a DNA duplex with a greater degree of 
base pair- mismatching than could occur under "highly stringent conditions" is able to form. 
Exemplary moderately stringent conditions are 0.015 M sodium chloride, 0.0015 M sodium 
citrate at 50-65°C or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 20% fonnamide at 

25 37-50°C. By way of example, moderately stringent conditions of 50°C in 0.015 M sodium ion 
will allow about a 21% mismatch. During hybridization, other agents may be included in the 
hybridization and washing buffers for the purpose of reducing non-specific and/or background 
hybridization. Examples are 0.1% bovine serum albumin, 0.1% polyvinylpyrrolidone, 0.1% 
sodium pyrophosphate, 0.1% sodium dodecylsulfate, NaDodS0 4 , (SDS), ficoll, Denhaidt's 

30 solution, sonicated salmon sperm DNA (or another non-complementary DNA), and dextran 
sulfate, although other suitable agents can also- be used. The concentration and types of these 
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additives can be changed without substantially affecting the stringency of the hybridization 
conditions.. Hybridization experiments are usually carried out at pH 6.8-7.4; however, at typical 
ionic strength conditions, the rate of hybridization is nearly independent of pH. 

In preferred embodiments of the present invention, vectors are used to transfer a nucleic 
5 acid sequence encoding an immunogenic target to a cell. A vector is any molecuie used to 
transfer a nucleic acid sequence to a host cell. In certain cases, an expression vector is utilized. 
An expression vector is a nucleic acid molecule that is suitable for transformation of a host cell 
and contains nucleic acid sequences that direct and / or control the expression of the transferred 
nucleic acid sequences. Expression includes, but is not limited to, processes, such as 

10 transcription, translation, and splicing, if introns are present. Expression vectors typically 
comprise one or more flanking sequences operably linked to a heterologous nucleic acid 
sequence encoding a polypeptide. Flanking sequences may be homologous (i.e., from the same 
species and / or strain as the host cell), heterologous (i.e., from a species other than the host cell 
species or strain), hybrid (i.e., a combination of flanking sequences from more than one source), 

.15 or synthetic, for example. 

A flanking sequence is preferably capable of effecting the replication, transcription and / 
or translation of the coding sequence and is operably linked to a coding sequence. As used 
herein, the term operably linked refers to a linkage of polynucleotide elements in a functional 
relationship. For instance, a promoter or enhancer is operably linked to a coding sequence if it 

. 20 affects the transcription of the coding sequence. However, a flanking sequence need not 
necessarily be contiguous with the coding sequence, so long as it functions correctly. Thus, for 
example, intervening untranslated yet transcribed sequences can be present between a promoter 
sequence and the coding sequence and the promoter sequence may still be. considered operably 
linked to the coding sequence. Similarly, an enhancer sequence may be located upstream or 

25 downstream from the coding sequence and affect transcription of the sequence. 

In certain embodiments, it is preferred that the flanking sequence is a transcriptional 

regulatory region that drives high-level gene expression in the target cell. The transcriptional 

regulatory region may comprise, for example, a promoter, enhancer, silencer, repressor element, 

or combinations thereof. The transcriptional regulatory region may be either constitutive, tissue- 

30 specific, cell-type specific (i.e., the region is drives higher levels of transcription in a one type of 

tissue or cell as compared to another), or regulatable (i.e., responsive to interaction with a 

8 
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compound such as tetracycline). The source of a transcriptional regulatory region may be any 
prokaryotic or eukaryotic organism, any vertebrate or invertebrate organism, or any plant, 
provided that the flanking sequence functions in a cell by causing transcription of a nucleic acid 
within that cell. A wide variety of transcriptional regulatory regions may be utilized in practicing 
5 the present invention. 

Suitable transcriptional regulatory regions include the CMV promoter (i.e., the CMV- 
immediate early promoter); promoters from eukaryotic genes (i.e., the estrogen-inducible 
chicken ovalbumin gene, the interferon genes, the gluco-corticoid-inducible tyrosine 
aminotransferase gene, and the thymidine kinase gene); and the major early and late adenovirus 
10 gene promoters; the SV40 eariy promoter region (Bernoist and Chambon, 1981, Nature 290:304- 
10); the promoter contained in the 3Mong terminal repeat (LTR) of Rous sarcoma virus (RSV) 
(Yamamoto, et al, 1980, Cell 22:787-97); the herpes simplex virus thymidine kinase (HSV-TK) 
promoter (Wagner et al, 1981, Proc. Natl. Acad. Set U.SA. 78:1444-45); the regulatory 
sequences of the metallothionine gene (Brinster et al, 1982, Nature 296:39-42); prokaryotic 
15 expression vectors such as the beta-lactamase promoter (Villa-Kamaroff et al, 1978, Proc. Natl 
Acad. Set USA., 75:3727-31); or the tac promoter (DeBoer et al, 1983, Proc. Natl Acad. Sci. 
U.SA., 80:21-25). Tissue- and / or cell-type specific transcriptional control regions include, for 
example, the elastase I gene control region which is active in pancreatic acinar cells (Swift et al, 
1984, Cell 38:639-46; Ornitz et al, 1986, Cold Spring Harbor Symp. Quant. Biol 50:399-409 
20 (1986); MacDonald, 1987, Hepatology 7:425-515); the insulin gene control region which is 
active in pancreatic beta cells (Hanahan, 1985, Nature .315:1 15-22); the immunoglobulin gene, 
control region which is active in lymphoid cells (Grosschedl et al, 1984, Cell 38:647-58; 
Adames et al, 1985, Nature 318:533-38; Alexander et aU 1987, Mol Cell Biol, 7:1436-44); the 
mouse mammary tumor virus control region in testicular, breast, lymphoid and mast cells {Leder 
25 etal, 1986, Cell 45:485-95); the albumin gene control region in liver (Pinkert e*a/., 1987, Genes 
and Devel 1:268-76); the alpha-feto-protein gene control region in liver (Krumlauf et al, 1985, 
Mol Cell Biol, 5:1639-48; Hammer et al, 1987, Science 235:53-58); the alpha 1-antitrypsin 
gene control region in liver (Kelsey et al, 1987, Genes and Devel 1:161-71); the beta-globin 
.gene control region in myeloid cells (Mogram et al, 1985, Nature 315:338-40; Kollias et al, 
30 1 986, Cell 46:89-94); the myelin basic protein gene control region in oligodendrocyte cells in the 

brain (Readhead et al, 1987, Cell 48:703-12); the myosin light chain-2 gene control region in 

9 



WO 2005/026370 PCT/US2004/028751 



skeletal muscle (Sani, 1985, Nature 314:283-86); the gonadotropic releasing hormone gene 
control region in the hypothalamus (Mason et a/.„ 1986, Science 234:1372-78), and the 
tyrosinase promoter in melanoma cells (Hart, I. Sernin Oncol 1996Feb;23(l):154-8; Siders, etal. 
Cancer Gene Ther 1998 Sep-Oct;5(5):281-9i), among others. Inducible promoters that are 
5 activated in the presence of a. certain compound or condition such as light, heat, radiation, 
tetracycline, or heat shock proteins, for example, may also be utilized (see, for example, WO 
00/10612). Other suitable promoters are known in the art. 

As described above, enhancers may also be suitable flanking sequences. Enhancers are 
cis-acting elements of DNA, usually about 10-300 bp in length, that act on the promoter to 
10 increase transcription. Enhancers are typically orientation- and position-independent, having 
been identified both 5' and 3' to controlled coding sequences. Several enhancer sequences 
available from mammalian genes are known (i.e., globin, diastase, albumin, alpha-feto-protein 
and insulin). Similarly, the SV40 enhancer, the cytomegalovirus early promoter enhancer, the 
polyoma enhancer, and adenovirus enhancers are useful with eukaryotic promoter sequences. 
15 While an enhancer may be spliced into the vector at a position 5' or 3' to nucleic acid coding 
sequence, it is typically located at a site 5' from the promoter. Other suitable enhancers are 
known in the art, and would be applicable to the present invention. 

While preparing reagents of the present invention, cells may need to be transfected or 
transformed. Transfection refers to the uptake of foreign or exogenous DNA by a cell, and a cell 
20 has been transfected when the exogenous DNA has been introduced inside the cell membrane. A 
number of transfection techniques are well known in the art (i.e., Graham et al, 1973, Virology 
52:456; Sambrook et al, Molecular Cloning, A Laboratory Manual (Cold Spring Harbor 
Laboratories, 1989);.Davis et al., Basic Methods in Molecular Biology (Elsevier, 1986); and Chu 
et al, 1981, Gene 13:197). Such techniques can be used to introduce one or more exogenous 
25 DNA moieties into suitable host cells. 

In certain embodiments, it is preferred that transfection of a cell results in transformation 
of that cell. A cell is transformed when there is a change in a characteristic of the cell, being 
transformed when it has been modified to contain a new nucleic acid. Following transfection, 
the transfected nucleic acid may recombine with that of the cell by physically integrating into a 
30 chromosome of the cell, may be maintained transiently as an episomal element without being 
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replicated, or may replicate independently as a plasmid. A cell is stably transformed when the 
nucleic acid is replicated with the division of the cell. 

The expression vectors of the present invention also provide for expression of fragments 
of immunogenic targets. Fragments may include sequences truncated at the amino terminus 
5 (with or without a leader sequence) and / or the carboxy tenninus. Fragments may also include 
variants (i.e., allelic, splice), orfhologs, homologues, and other variants having one or more 
amino acid additions or substitutions or internal deletions as compared to the parental sequence. 
In preferred embodiments, truncations and/or deletions comprise about 1-5 amino acids, 5-10 
amino acids, 10-20 amino acids, 20-30 amino acids, 30-40 amino acids, 40-50 amino acids, or 

10 more. Such polypeptide fragments may optionally comprise an amino terminal methionine 
residue. It will be appreciated that such fragments can be used, for example, to generate 
antibodies or cellular immune responses to immunogenic targets. 

A variant is a sequence having one or more sequence substitutions, deletions, and/or 
additions as compared to the subject sequence. Variants may be naturally occurring or 

15 artificially constructed. Such variants may be prepared from the corresponding nucleic acid 
molecules. In preferred embodiments, the variants have from 1 to 3, or from 1 to 5, or from 1 to 
10, or from 1 to 15, or from 1 to 20, or from 1 to 25, or from 1 to 30, or from 1 to 40, or from 1 
to 50, or more than 50 amino acid substitutions, insertions, additions and/or deletions. 

An allelic variant is one of several possible naturally-occurring alternate forms of a 

20 sequence occupying a given locus on a chromosome of an organism or a population of 
organisms. A splice variant is a polypeptide generated from one of several RNA. transcript 
resulting from splicing of a primary transcript. An orfholog is a similar nucleic acid or 
polypeptide sequence from another species. For example, the mouse and human versions of an 
immunogenic target , may be considered orthologs of each other. A derivative of a sequence is 

25 one that is derived from a parental sequence those sequences having substitutions, additions, 
deletions, or chemically modified variants. Variants may also include fusion proteins, which 
refers to the fusion of one or more first sequences (such as a peptide) at the amino or carboxy 
terminus of at least one other sequence (such as a heterologous peptide). 

''Similarity" is a concept related to identity, except that similarity refers to a measure of 

30 relatedness which includes both identical matches and conservative substitution matches. If two 
polypeptide sequences have, for example, 10/20 identical amino acids, and the remainder are all 
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non^onservative substitutions, then the percent identity and similarity would both be 50%. If in 
the same example, there are five more positions where there are conservative substitutions, then 
the percent identity remains 50%, but the percent similarity would be 75% (15/20). Therefore, in 
cases where there are conservative substitutions, the percent similarity between two polypeptides 
will be higher than the percent identity between those two polypeptides. 

Substitutions may be conservative, or non-conservative, or any combination thereof. 
Conservative amino acid modifications to the sequence of a polypeptide (and the corresponding 
modifications to the encoding nucleotides) may produce polypeptides having functional and 
chemical characteristics similar to those of a parental polypeptide. For example, a "conservative 
amino acid substitution" may involve a substitution of a native amino acid residue with a non- 
native residue such that there is little or no effect on the size, polarity, charge, hydrophobicity, or 
hydrophilicity of the amino acid residue at that position and, in particlar, does not result in 
decreased immunogenicity. Suitable conservative amino acid substitutions are shown in Table I. 



15 



Table I 



Original 
Residues 


Exemplary Substitutions 


Preferred 
Substitutions 


Ala 


Val, Leu, lie 


Val 


Arg 


Lys, Gin, Asn 


Lys 


Asn 


Gin 


Gin 


Asp 


Glu 


Glu 


Cys 


Ser, Ala 


Ser 


Gin 


Asn 


Asn 


Glu 


Asp 


Asp 


Gly 


Pro, Ala 


Ala 


His 


Asn, Gin, Lys, Arg 


Arg 


lie 


Leu, Val, Met, Ala, Phe, Norleucine 


Leu 


Leu 


Norleucine, He, Val, Met, Ala, Phe 


lie 


Lys 


Arg, 1,4 Diamino-butyric Acid, Gin, Asn 


Arg 


Met 


Leu, Phe, lie 


Leu 


Phe 


Leu, Val, He, Ala, Tyr 


Leu 


Pro 


Ala 


Gly 


Ser 


Thr, Ala, Cys 


Thr 


Thr 


Ser 


Ser 


Trp 


Tyr, Phe 


Tyr 


Tyr 


Trp, Phe ; Thr, Sa- 


Phe 


Val 


lle, Met, Leu, Phe, Ala, Norleucine 


Leu 



12 



WO 2005/026370 



PCT/US2004/028751 



A skilled artisan will be able to determine suitable variants of an immunogenic target 
using well-known techniques. For identifying suitable areas of the molecule that may be 
changed without destroying biological activity (i.e., MHC binding, immunogenicity), one skilled 
in the art may target areas not believed to be important for that activity. For example, when 
immunogenic targets with similar activities from the same species or from, other species are 
known, one skilled in the art may compare the amino acid sequence of a polypeptide to such 
similar polypeptides. By performing such analyses, one can identify residues and portions of the 
molecules that are conserved. It will be appreciated that changes in areas of. the molecule that 
are not conserved relative to such similar immunogenic targets would be less likely to adversely 
affect the biological activity and/or structure of a polypeptide. Similarly, the residues required 
for binding to MHC are known, and may be modified to improve binding. However, 
modifications resulting in decreased binding to MHC will not be appropriate in most situations. 
One skilled in the art would also know that, even in relatively conserved regions, one may 
substitute chemically similar amino acids for the naturally occurring residues while retaining 
activity. Therefore, even areas that may be important for biological activity or for structure may 
be subject to conservative amino acid substitutions without destroying the biological activity or 
without adversely affecting the structure of the immunogenic target. 

Other preferred polypeptide variants include glycosylation variants wherein the number 
and/or type of glycosylation sites have been altered compared to the subject amino acid 
sequence. In one embodiment, polypeptide variants comprise a greater or a lesser number of N- 
linked glycosylation sites than the subject amino acid sequence. An N-linked glycosylation site 
is characterized by the sequence Asn-X-Ser or Asn-X-Thr, wherein the. amino acid residue 
designated as X may be any amino acid residue except proline. The substitution of amino acid 
residues to create this sequence provides a potential new site for the addition of an N-linked 
carbohydrate chain. Alternatively, substitutions that eliminate .this sequence will remove an 
existing N-linked carbohydrate chain. Also provided is a rearrangement of N-linked 
carbohydrate chains wherein one or more N-linked glycosylation sites (typically those that are 
naturally occurring) are eliminated and one or more new N-linked sites are created. To afFect O- 
linked glycosylation of a polypeptide, one would modify serine and / or threonine residues. 
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Additional preferred variants include cysteine variants, wherein one or more cysteine 
residues are deleted or substituted with another amino acid (e.g., serine) as compared to the 
subject amino acid sequence set. Cysteine variants are useful when peptides or polypeptides 
must be refolded into a biologically active conformation such as after the isolation of insoluble 
inclusion bodies. Cysteine variants generally have fewer cysteine residues than the native 
protein, and typically have an even number to minimize interactions resulting from unpaired 
cysteines. 

In other embodiments, the peptides or polypeptides may be attached to one or more 
fusion segments that assist in purification of the polypeptides. Fusions can be made either at the 
amino terminus or at the carboxy terminus of the subject polypeptide variant thereof. Fusions 
may be direct with no linker or adapter molecule or may be through alinker or adapter molecule. 
A linker or adapter molecule may be one or more amino acid residues, typically from about 20 to 
about 50 amino acid residues. A linker or adapter molecule may also be designed with a 
cleavage site for a DNA restriction endonuclease or for a protease to allow for the separation of 
the fused moieties. It will be appreciated that once constructedi the fusion polypeptides can be 
derivauzed according to the methods described herein. Suitable fusion segments include, among 
otheis, metal binding domains (e.g., apoly^histidine segment), immunoglobulin binding domains 
(i.e., Protein A, Protein G, T cell, B cell, Fc receptor, or complement protein antibody-binding 
domains), sugar binding domains (e.g., a maltose binding domain), and/or a "tag" domain (i.e., at 
least a portion of a-galactosidase, a strep tag peptide, a T7 tag peptide, a FLAG peptide, or other 
domains that can be purified using compounds that bind to the domain, such as monoclonal 
antibodies). This tag is typically fused to the peptide or polypeptide and upon expression may 
serve as a means for affinity purification of the sequence of interest polypeptide from the host 
cell. Affinity purification can be accomplished, for example, by column chromatography using 
antibodies against the tag as an affinity matrix. Optionally, the tag can subsequently be removed 
from the purified sequence of interest polypeptide by various means such as using certain 
peptidases for cleavage.. As described below, fusions may also be made between a TA and a co- 
stimulatory components such as the chemokines CXC10 (IP-10), CCL7 (MCP-3), or CCL5 
(RANTES), for example. 

A fusion motif may enhance transport of an immunogenic target to an MHC processing 
compartment, such as the endoplasmic reticulum. These sequences, referred to as tranduction or 
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transcytosis sequences, include sequences derived from HIV tat (see Kim et al. 1997 J. Immunol. 
\59:l666),.Drosophila antennapedia (see Schutze-Redelmeier et al. 1996 J. Immunol. 157:650), 
or human period-1 protein (hPERl; in particular, SRRHHCRSKAKRSRHH). 

In addition, the polypeptide or variant thereof may be fused to a homologous peptide or 
5 polypeptide to form a homodimer or to a heterologous peptide or polypeptide to form a 
heterodimer. Heterologous peptides and polypeptides include, but are not limited to an epitope 
to allow for the detection and/or isolation of a fusion polypeptide; a transmembrane receptor 
protein or a portion thereof, such as an extracellular domain or a transmembrane and intracellular 
domain; a ligand or a portion thereof which binds to a transmembrane receptor protein; an 
10 enzyme or portion thereof which is catalytically active; a polypeptide or peptide which promotes 
oligomerization, such as a- leucine zipper domain; a polypeptide or peptide which increases 
stability, such as an immunoglobuUn constant region; a peptide or polypeptide which has a 
therapeutic activity different from the peptide or polypeptide; and/or variants thereof. 

In certain embodiments, it may be advantageous to combine a nucleic acid sequence 
15 encoding an immunogenic target with one or more co-stimulatory components) such as cell 
surface proteins, cytokines or chemokines in a composition of the present invention. The co- 
stimulatory component may be included in the composition as a polypeptide or as a nucleic acid 
encoding the polypeptide, for example. Suitable co-stimulatory molecules include, for instance, 
polypeptides that bind members of the CD28 family (i.e., CD28, ICOS; Hutloff, et al. Nature 
20 1999, 397: 263-265; Peach, et al. J Exp Med 1994, 180: 2049-2058) such as the CD28 binding 
polypeptides B7.1 (CD80; Schwartz, 1992; Chen et al; 1992; Ellis, et al. J. Immunol., 156(8): 
2700-9), B7.2 (CD86; Ellis, et al. J. Immunol, 156(8): 2700-9), and mutants / variants hereof 
(WO 00/66162); polypeptides which bind members of the integrin family (i.e., LFA-1 (CD1 la / 
CD18); Sedwick, et.al. J Immunol 1999, 162: 1367-1375; Wulfing, et al. Science 1998, 282: 
25 2266-2269; Lub, et al. Immunol Today 1995, 16: 479-483) including members of the ICAM 
family (i.e., ICAM-1, -2 or -3); polypeptides which bind CD2 family members (i;e., CD2, 
signalling lymphocyte activation molecule (CDwlSO or "SLAM"; Aversa, et al. 
J Immunol 1997, 158: 4036-4044)) such as CD58 (LFA-3; CD2 ligand; Davis, et al: Immunol 
Today 1996, 17: 177-187) or SLAM ligands (Sayos, et al. Nature 1998, 395: 462-469); 
30 polypeptides which bind heat stable antigen (HSA or CD24; Zhou, et al. Eur J Immunol 1997, 

27: 2524-2528); polypeptides which bind to members of the TNF receptor (TNFR) family (i.e., 
■ ' ' 15 
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4-1BB (CD137; Vinay, et al. Semin Immunol 1998, 10: 481-489), OX40 (CD134; Weinberg, et 
al. Semin Immunol 1998, 10: 471-480; Higgins, et al. J Immunol 1999, 162: 48<M93), and 
CD27 (Lens, et al. Semin Immunol 1998, 10: 491^99)) such as 4-1BBL (4-1BB ligand; Vinay, 
et al. Semin Immunol 1998, 10: 48M8; DeBenedette, et al. J Immunol 1997, 158: 551-559), 
5 TNFR associated factor-1 (TRAF-1; 4-1BB ligand; Saoull'i, et al. J Exp Med 1998, 187: 1849- 
1862, Arch, et al. Mol Cell Biol 1998, 18: 558-565), TRAF-2 (4-1BB and OX40 ligand; SaoulU, 
et al. J Exp Med 1998, 187: 1849-1862; Oshima, et al. Int Immunol 1998, 10: 517-526,. 
Kawamata, et al. J Biol Chem 1998, 273: 5808-5814), TRAF-3 (4-lBB.and OX40 ligand; Arch, 
et al. . Mol Cell Biol 1998, 18: 558-565; Jang, et al. Biochem Biophys Res Commun 1998, 242: 
10 613-620; Kawamata S, et al. J Biol Chem 1998, 273: 5808-5814), OX40L (OX40 ligand; 
Gramaglia, et al. J Immunol 1998, 161: 6510-6517), TRAF-5 (OX40 ligand; Arch, et al. Mol 
Cell Biol 1998, 18: 558-565; Kawamata, et al. J Biol Chem 1998, 273: 5808-5814), and CD70 
(CD27 ligand; Couderc, et al. Cancer Gene Ther., 5(3): 163-75). CD154 (CD40 ligand or 
"CD40L"; Gurunathan, et al. /. Immunol., 1998, 161: 4563-4571; Sine, etal. Hum. Gene Ther., 
.15 2001, 12: 1091-1 102) may also be suitable. 

One or more cytokines may also be suitable co-stimulatory components or "adjuvants", 
either as polypeptides or being encoded by nucleic acids contained within the compositions of 
the present invention (Parmiani, et al. Immunol Lett 2000 Sep 15; 74(1): 41-4; Berzofsky, et al. 
Nature Immunol. 1: 209-219). Suitable cytokines include, for example, interieukin-2 (IL-2) 
. 20 (Rosenberg, et al. Nature Med. 4: 321-327 (1998)), IL-4, IL-7, IL-12 (reviewed by Pardoll, 
1992; Harries, et al. J. Gene Med 2000 Jul-Aug;2(4):243-9; Rao, et al. J: Immunol. 156: 3357- 
3365 (1996)), IL-15 (Xin, et al. Vaccine, 17:858-866, 1999), IL-16 (Cruikshank, et al. J. Leuk 
Biol. 67(6): 757-66, 2000), IL-1 8 (J. Cancer Res. Clin. Oncol. 2001! 127(12): 718-726), GM- 
CSF (CSF (Disis, et al. Blood, 88: 202-210 (1996)), tumor necrosis factor-alpha (TNF-ct), or 
25 interferons such as IFN-a or INF-y. Other cytokines may also be suitable for practicing the 
present invention, as is known in the art. 

Chemokines may also be utilized, in either polypeptide or nucleic acid form. Fusion 
proteins comprising CXCL10 (IP-10) and CCL7 (MCP-3) fused to a tumor self-antigen have 
been shown to induce, anti-tumor immunity (Biragyn, et al. Nature Biotech. 1999, 17: 253-258). 
30 The chemokines CCL3 (MIP-la) and CCL5 (RANTES) (Boyer, et al: Vaccine, 1999, 17 (Supp. 
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2): S53-S64) may also be of use in practicing the present invention. Other suitable chemokines 
are known in the art. 

It is also known in the art that suppressive or negative regulatory immune mechanisms 
may be blocked, resulting in enhanced immune responses. For instance, treatment with anti- 
5 CTLA-4 (Shrikant, et al. Immunity, 1996, 14: 145-155; Sutmuller, et al. /. Exp. Med., 2001, 
194: 823-832), anti-CD25 (Sutmuller, supra), anti-CD4 (Matsui, et al. J. Immunol, 1999, 163: 
184-193), the fusion protein IL13Ra2-Fc (Terabe, et al. Nature Immunol., 2000, 1: 515-520), 
and combinations thereof (i.e., anti-CTLA-4 and anti-CD25, Sutmuller, supra) have been shown 
to upregulate anti-tumor immune responses and would be suitable in practicing the present 
10 invention. Such treatments, among others, , may also be combined with the one or more 
immunogenic targets of the present invention. 

Any of these components may be used alone or in combination with other agents. For 
instance, it has been shown that a combination of CD80, ICAM-1 and LFA-3 ("TRICOM") may 
potentiate anti-cancer immune responses (Hodge, et al. Cancer Res. 59: 5800-5807 (1999). 
15 Other effective combinations include, for example, IL-12 + GM-CSF (Ahlers, et al. J. Immunol, 
158: 3947-3958 (1997); Iwasaki, et al. /. Immunol. 158: 4591-4601 (1997)), IL-12 + GM-CSF + 
TNF-a (Ahlers, et al. Int. Immunol 13: 897-908 (2001)), CD80 + IL-12 (Fruend, et al. Int. J. 
Cancer, 85: 508-517 (2000); Rao, et al. supra), and CD86 + GM-CSF + IL-12 (Iwasaki, supra). 
One of skill in the art would be aware of additional combinations useful in carrying out the 
20 present invention. In addition, the skilled artisan would be aware of additional reagents or 
methods that may be used to modulate such mechanisms. These reagents and methods, as well 
as others known by those of skill in the art, may be utilized in practicing the present invention. 

Additional strategies for improving the efficiency of nucleic acid-based immunization 
may also be used including, for example, the use of self-replicating viral replicons (Caley, et al. 
25 1999. Vaccine, 17: 3124-2135; Dubensky, et al. 2000. Mol. Med 6: 723-732; Leitner, ct al. 
2000. Cancer Res. 60: 51-55), codon optimization (Liu, et al. 2000. Mol Tlier., 1: 497-500; 
Dubensky, supra; Huang, et al. 2001. J. Virol 75: 4947-4951), in vivo electroporation (Widera, 
et al. 2000. /. Immunol 164: 4635-3640), incorporation of CpG stimulatory motifs 
(Gurunathan, et al. Ann. Rev. Immunol, 2000, 18: 927-974; Leitner, supra; Cho, et al. J. 
30 Immunol. 168(10):4907-13), sequences for targeting of the endocytic or ubiquitin-processing 
pathways (Thomson, et al. 1998. J. Virol 72: 2246-2252; Velders, et al. 2001. J. Immunol 
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166: 5366-5373), Marek's disease virus type 1 VP22 sequences (J. Virol. 76(6):2676-82, 2002), 
prime-boost regimens (Gurunathan, supra) Sullivan, et al. 2000. Nature, 408: 605-609; Hanke, 
et al. 1998. Vaccine, 16: 439-445; Amara, et al. 2001. Science, 292: 69-74), and the use of 
mucosal delivery vectors such as Salmonella. (Darji, et al. 1997. Cell, 91: 765-775; Woo, et al. 
5 2001. Vaccine, 19: 2945-2954). Other methods are known in the art, some of which are 
described below. 

. Chemotherapeutic agents, radiation, anti-angiogenic compounds, or other agents may. 
also be utilized in treating and / or preventing cancer using immunogenic targets (Sebti, et al. 
Oncogene 2000 Dec 27;1 9(56):6566-73). For example, in treating metastatic melanoma, suitable 

10 chemotherapeutic regimens may include BELD (bleomycin, vindesine, lomustine, and 
deacarbazine; Young, et al. 1985. Cancer, 55: 1879-81), BOLD (bleomycin, vincristine, 
lomustine, dacarbazine; Seigler, et al: 1980. Cancer, 46 : 2346-8); DD (dacarbazine, 
actinomycin; Hochster, et al. Cancer Treatment Reports, 69: 39-42), or POC (procarbazine, 
vincristine, lomustine; Carmo-Pereira, et al. 1984. Cancer Treatment Reports, 68: 1211-4) 

15 among others. Other suitable chemotherapeutic regimens may also be utilized. 

Many anti-angiogenic agents are known in the art and would be suitable for co- 
administration with the immunogenic target vaccines and/or chemotherapeutic regimens (see, for 
example, Timar, et al. 2001. Pathology Oncol. Res., 7(2): 85-94). Such agents include, for 
example, physiological agents such as growth factors (i.e., ANG-2, NKl,2,4 (HGF), 
,20 transforming growth factor beta (TGF-p)), cytokines (i.e., interferons such as IFN-ct, -p,.-Y, 
platelet factor 4 (PF-4), PR-39), proteases (i.e., cleaved AT-ffl, collagen XVIII fragment 
(Endostatin)), HmwKallikrein-d5 plasmin fragment (Angiostatin), prothroinbin-Fl-2, TSP-1), 
protease inhibitors (i.e., tissue inhibitor of metalloproteases such as TIMP-1, -2, or -3; maspin; 
plasminogen activator-inhibitors such as PAI-1; pigment epithelium derived factor (PEDF)), 

25 Tumstatin (available through ILEX, Inc.), antibody products (i.e., the collagen-binding 
antibodies HUIV26, HUI77, XL313; anti-VEGF; anti-integrin. (i.e., Vitaxin, {Lxsys))), and 
glycosidases (i.e., heparinase-I, -III). "Chemical" or modified physiological agents known or 
believed to have anti-angiogenic potential include, for example, vinblastine, taxol, ketoconazole, 
thalidomide, dolestatin, combrestatin A, rapamycin (Guba, et al. 2002, Nature Med., 8: 128- 

30 135), CEP-7055 (available from Cephalon, Inc.), flavone acetic acid, Bay 12-9566 (Bayer 
Corp.), AG3340°(Ag°u ron > Inc.), CGS 27023A (Novartis), tetracylcine derivatives (i.e., COL-3 
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(Collagenix, Inc.)), Neovastat (Aeterna), BMS-275291 (Bristol-Myers Squibb), low dose 5-FU, 

low dose methotrexate (MTX), irsofladine, radicicol, cyclosporin, captopril, celecoxib, 

D45152-sulphated polysaccharide, cationic protein (Protamine), cationic peptide-VEGF, 

Suramin (polysulphonated napthyl urea), compounds that interfere with the function or 

5 production of VEGF (i.e., SU5416 or SU6668 (Sugen), PTK787/ZK22584 (Novartis)), 

Distamycin A, Angiozyme (ribozyme), isoflavinoids, staurosporine derivatives, genistein, 

EMD121974 (Merck KcgaA), tyrphostins, isoquinolones, retinoic acid, carboxyamidotriazole, 

TNP-470, octreotide, 2-methoxyestradiol, aminosterols (i.e., squalamine), glutathione analogues 

(i.e., N-acteyl-L-cysteine), combretastatin A-4 (Oxigene), Eph receptor blocking agents {Nature, 

10 414:933-938, 2001), Rh-Angiostatin, Rh-Endostatin (WO 01/93897), cyclic-RGD peptide, 

accutin-disintegrin, benzodiazepenes," humanized anti-avb3 Ab, Rh-PAI-2, amilpride, p- 

amidobenzamidine, anti-uPA ab, anti-uPAR Ab, L-phanylalanin-N-methylamides (i.e., 

Batimistat, Marimastat), AG3340, and minocycline! Many other suitable agents are known in 

the art and would suffice in practicing the present invention. 

15 The present invention may also be utilized in combination with "non-traditional" 

methods of treating cancer. For example, it has recently been demonstrated that administration 

of certain anaerobic bacteria may assist in slowing tumor growth. In one study, Clostridium 

novyi was modified to eliminate a toxin gene carried on a phage episome and administered to 

mice with colorectal tumore (Dang, et al. RNA.S. USA, 98(26): 15155-15160, 2001). In 

20 combination with chemotherapy, the treatment was shown to cause tumor necrosis in the 

animals. The reagents and methodologies described in . this application may be combined with 

such treatment methodologies. 

Nucleic acids encoding immunogenic targets may be administered to patients by any of 

several available techniques. Various viral vectors that have been successfully utilized for 

25 introducing a nucleic acid to a host include retrovirus, adenovirus, adeno-associated virus 

(AAV), herpes virus, and poxvirus, among others. It is understood in the art that many such viral 

vectors are available in the art. The vectors of the present invention may be constructed using 

standard recombinant techniques widely available to one skilled in the art. Such techniques may 

be found in common molecular biology references such as Molecular Cloning: A Laboratory 

30 Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press), Gene Expression 

Technology (Methods in Enzymology, Vol. 185, edited by D. Goeddel, 1991. Academic Press, 
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San Diego, CA), and PCR Protocols: A Guide to Methods and Applications (Ihnis, et al. 1990. 

Academic Press, San Diego, CA). 

Preferred retroviral vectors are derivatives of lentivirus as well as derivatives of murine 
or avian retroviruses. Examples of suitable retroviral vectors include, for example, Moloney 
murine leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary 
tumor virus (MuMTV), SIV, BIV, HIV and Rous Sarcoma Virus (RSV). A number of retroviral 
vectors can incorporate multiple exogenous nucleic acid sequences. As recombinant retroviruses 
are defective, they require assistance in order to produce infectious vector particles. This 
assistance can be provided by, for example, helper cell lines encoding retrovirus structural genes. 
Suitable helper cell lines include ¥2, PA317 and PA12, among others; The vector virions 
produced using' such cell lines may then be used to infect a tissue .cell line, such as NIH 3T3 
cells, to produce large quantities of chimeric retroviral virions. Retroviral vectors may be 
administered by traditional methods (i.e., injection) or by implantation of a "producer cell line" 
in proximity to the target cell population (Culver, K, et al., .1994, Hum. Gene Ther., 5 (3): 343- 
79; Culver, KL, et al., Cold Spring Harb. Symp. Quant. Biol., 59: 685-90); Oldfield, E.,1993, 
Hum. Gene Ther., 4 (I): 39-69). The producer cell line is engineered to produce a viral vector 
and releases viral particles in the vicinity of the target cell. A portion of the released viral 
particles contact the target cells and infect those cells, thus delivering a nucleic acid of the 
present invention to the target cell. Following infection of the target cell, expression of the 
nucleic acid of the vector occurs. 

Adenoviral vectors have proven especially useful for gene transfer into eukaryotic cells 
(Rosenfeld, M., et al., 1991, Science, 252 (5004): 431-4; Crystal, R., et al., 1994, Nat Genet., 8 
(1): 42-51), the study eukaryotic gene expression (Levrero, M., et al., 1991, Gene, 101 (2): 195- 
202), vaccine development (Graham, F. and Preyec, L., 1992, Biotechnology, 20: 363-90), and in 
animal models (Stratford-Perricaudet, L., et al., 1992, Bone Marrow Transplant., 9 (Suppl. .1): 
151-2 ; Rich, D., et al, 1993, Hum. Gene Ther., 4 (4): 461-76). Experimental routes for 
administrating recombinant Ad to different tissues in vivo have included intratracheal instillation 
(Rosenfeld, M., et al.. 1992, Cell, 68 (1): 143-55) injection into muscle (Quantin, B., et al., 1992, 
Proc. Natl. Acad. Sci. U.S.A., 89 (7): 2581-4), peripheral intravenous injection (Here, J., and 
Gerard, R., 1993, Proc. Natl. Acad. Sci. U.SA., 90 (7): 2812-6) and stereotactic inoculation to 

brain (Le Gal La Salle, G., et al., 1993, Science, 259 (5097): 988-90), among others. 
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Adeno-associated virus (AAV) demonstrates high-level infectivity, broad host range and 
specificity in integrating into the host cell genome (Hermonat, P., et al., 1984, Proc. Natl. Acad. 
Sci. U.S.A., 81 (20): 6466-70). And Herpes Simplex Virus type-1 (HSV-1) is yet another 
attractive vector system, especially for use in the nervous system because of its neurotropic 

5 property (Geller, A., et al, 1991, Trends Newosci., 14 (10): 428-32; Glorioso, et al, 1995, Mol 
Biotechnol, 4 (1): 87-99; Glorioso, et al, 1995, Arum Rev. Microbiol, 49: 675-710). 

Poxvirus is another useful expression vector (Smith, et al. 1983, Gene, 25 (1): 21-8; 
Moss, et al, 1992, Biotechnology, 20: 345-62; Moss, et al, 1992, Curr. Top. Microbiol. Immunol, 
158: 25-38; Moss, et al. 1991. Science, 252: 1662-1667). Poxviruses shown to be useful 

10 include vaccinia, NYVAC, avipox, fowlpox,. canarypox, ALVAC, and ALVAC(2), among 

■ . others. 

NYVAC (vP866) was derived from the Copenhagen vaccine strain of vaccinia virus by 
deleting six nonessential regions of the genome encoding known or potential virulence factors 
(see, for example, U.S. Pat. Nos. 5,364,773 and. 5,494,807). The deletion loci were also 

15 engineered as recipient loci for the insertion of foreign genes. The deleted regions are: 
thymidine kinase gene (TK; J2R); hemorrhagic region (u; B13R+B14R); A type inclusion body 
region (ATI; A26L); hemagglutinin gene (HA; A56R); host range gene region (C7L-K1L); and, 
large subunit, ribonucleotide reductase (I4L). NYVAC is a genetically engineered vaccinia virus 
strain that was generated by the specific deletion of eighteen open reading frames encoding gene 

20 products associated with virulence and host range. NYVAC has been show to be useful for 
expressing TAs (see, for example, U.S. Pat. No. 6,265, 1 89). NYVAC (yP866), vP994, vCP205, 
vCP1433, placZH6H4Lreverse, pMPC6H6K3E3 and pC3H6FHVB were also deposited with the 
ATCC under the terms of the Budapest Treaty, accession numbers VR-2559, VR-2558, VR- 
2557, VR-2556, ATCC-97913, ATCC-97912, and ATCC-97914, respectively. 

25 ALVAC-based recombinant viruses (i.e., ALVAC-1 and ALVAC-2) are also suitable for 

use in practicing the present invention (see, for example, U.S. Pat. No. 5,756,103). ALVAC(2) 
is identical to ALVAC(l) except that ALVAC(2) genome comprises the vaccinia E3L and K3L 
genes under the control of vaccinia promoters (U.S. Pat. No. 6,130,066; Beattie et al., 1995a, 
1995b, 1991; Chang et al., 1992; Davies et al., 1993). Both ALVAC(l) and ALVAQ2) have 

30 been demonstrated to be useful in expressing foreign DNA sequences, such as TAs (Tartaglia et 

al., 1993 a,b; U.S. Pat. No. 5,833,975). ALVAC was deposited under the terms of the Budapest 
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Treaty with the American Type Culture Collection (ATCC), 10801 University Boulevard, 
Manassas Va. 201 10-2209, USA, ATCC accession number VR-2547. 

Another useful poxvirus vector is TROVAC. TROVAC refers to an attenuated fowlpox 
that was a plaque-cloned isolate derived from the FP-1 vaccine strain of fowlpoxvirus which is 
licensed for vaccination of 1 day old chicks. TROVAC was likewise deposited under the terms 
of the Budapest Treaty with the ATCC, accession number 2553. 

"Non-viral" plasmid vectors may also be suitable in practicing the present invention. 
Preferred plasmid vectors are compatible with bacterial, insect, and / or mammalian host cells. 
Such vectors include, for example, PCR-1I, pCR3, and pcDNA3.1 (Invitrogen, San Diego, CA), 
pBSII (Stratagene, La Jolla, CA), pET15 (Novagen, Madison, WI), pGEX (Pharmacia Biotech, 
Piscataway, NJ), pEGFP-N2 (Clontech, Palo Alto, CA), pETL (BlueBacII, Invitrogen), pDSR- 
alpha (PCT pub. No. WO 90/14363) and pFastBacDual (Gibco-BRL, Grand Island, NY) as well 
as Bluescript® plasmid derivatives (a high copy number COLEl-based phagemid, Stratagene 
Cloning Systems, La Jolla, CA), PCR cloning plasmids designed for cloning Taq-amplified PCR 
products (e.g., TOPO™ TA cloning® kit, PCR2.1® plasmid derivatives, Invitrogen, Carlsbad, 
CA). Bacterial vectors may also be used with the current invention. These vectors include, for 
example, Shigella, Salmonella, Vibrio cholerae, Lactobacillus, Bacille calmette guerin (BCG), 
and.Streptococcus (see for example, WO 88/6626; WO 90/0594; WO 91/13157; WO 92/1796; 
and WO 92/21376). Many other non-viral plasmid expression vectors and systems are known in 
20 the i art and could be used with the current invention. 

Suitable nucleic acid delivery techniques include DNA-ligand complexes, adenovirus- 
ligand-DNA complexes, direct injection of DNA, CaPO» precipitation, gene gun techniques, 
electroporation, and colloidal dispersion systems, among others. Colloidal dispersion systems 
include macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based systems 
including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The preferred 
colloidal system of this invention is a liposome, which are artificial membrane vesicles useful as 
delivery vehicles in vitro and in vivo. RNA, DNA and intact virions can be encapsulated within 
the aqueous interior and be delivered to cells in a biologically active form (Fraley, R., et al, 
198 1, Trends Biochem. Set, 6: 77). The composition of the liposome is usually a combination 
30 of phospholipids, particularly Mgh-pliase-transition-temperature phospholipids, usually in 
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combination with steroids, especially cholesterol. Other phospholipids or other lipids may also 
be used. The physical characteristics of liposomes depend on pH, ionic strength, and the 
presence of divalent cations. Examples of lipids useful in liposome production include 
phosphatidyl compounds, such as phosphatidylglycerol, phosphatidylcholine, 
phosphatidylserme, phosphatidylemanolarnine, sphingolipids, cerebrosides, and gangliosides. 
Particularly useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 
carbon atoms, particularly from 16-1 8 carbon atoms, and is saturated. Hlustrative phospholipids 
include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
distearoylphosphatidylcholine. 

An immunogenic target may also be administered in combination with one or more 
adjuvants to boost the inuriUne response. Exemplary adjuvants are shown in Table n below: 

Table n 
Types of Immunologic Adjuvants 



Type of 
Adjuvant 


General Examples 


Specific Examples/References 


Gel-type 


Aluminum hydroxide/phosphate ("alum 
adjuvants' 1 ) 


(Aggerbeck and Heron, 1995) 


Calcium phosphate 


(Relweld, 1986) 


Microbial 


Muramyl dipeptide (MDP) 


(Chedid etal., 1986) 


Bacterial exotoxins 


Cholera toxin (CT), Kcoli labile toxin 
(LT)(Freytag and Clements, 1999) 


Endotoxin-based adjuvants 


Monophosphoryl lipid A (MPL) 
(Ulrich and Myers, 1995) 


Other bacterial 


CpG oligonucleotides (Corral and 
Petray, 2000), BCG sequences (Krieg, et 
al. Nature, 374:576), tetanus toxoid 
(Rice, et al. J. Immunol , 2001, 167: 
1558-1565) 


Particulate 


Biodegradable 
Polymer microspheres 


(Gupta etal, 1998) 


Immunostimulatory complexes 
ttSCOMs) 


(Morein and Bengtsson, 1 999) 


Liposomes 


(Wassefetal., 1994) 


Oil-emulsion 
and 

surfactant- 
based 
adjuvants 


Freund's incomplete adjuvant 


(Jensen etal., 1998) 


Microfluidized emulsions 


MF59(Ott etal., 1995) 


SAF (Allison and Byars, 1992) 
(Allison. 1999) 


Saponins 


QS-21 (KensiL 1996) 


Synthetic 


Muramyl peptide derivatives 


Murabutide (Lederer, 1986) 
Threony-MDP (Allison, 1997) 


Nonionic block copolymers 


L121 (Allison, 1999) 


Polyphosphazene (PCPP) 


(Payne etal., 1995) 
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Synthetic polynucleotides 



Thalidomide derivatives 



Poly A:U, Poly hC (Johnson, 1994) 



CC-4047/ACTJMDD (J. Immunol., 
168(10):4914-9) 



Administration of a composition of the present invention to a host may be accomplished 
using any of a variety of techniques known to those of skill in the art. The composition(s) may 
be processed in accordance with conventional methods of pharmacy to produce medicinal agents 
for administration to patients, including humans and other mammals (i.e., a "pharmaceutical 
composition") . The pharmaceutical composition is preferably made in the form of a dosage unit 
containing a given amount of DNA, viral vector particles, polypeptide or peptide, for example. A 
suitable daily dose for a human or other mammal may vary widely depending on the condition of 
the patient and other factors, but, once again, can be determined using routine methods. 

The pharmaceutical composition may be administered orally, parentally, by inhalation 
spray, rectally, iniranodally; or topically in dosage unit formulations containing conventional 
pharmaceutical^ acceptable carriers, adjuvants, and vehicles. The term "pharmaceutically 
acceptable carrier" or "physiologically acceptable carrier" as used herein refers to one or more 
formulation materials suitable for accomplishing or enhancing the delivery of . a nucleic acid, 
polypeptide, or peptide as a pharmaceutical composition. A "pharmaceutical composition" is a 
composition comprising a merapeutically effective amount of a nucleic acid or polypeptide. The 
terms "effective amount" and "therapeutically effective amount" each refer to the amount of a 
nucleic acid or polypeptide used to induce or enhance an effective immune response. It is 
preferred that compositions of the present invention provide for the induction or enhancement of 
an anti-tumor immune response in a host which protects Ihe host from the development of a 
tumor and / or allows the host to eliminate an existing tumor from the body. 

For oral administration, the pharmaceutical composition may be of any of several forms 
including, for example, a capsule, a tablet, a suspension, or liquid, among others. Liquids may 
be administered by injection as a composition with suitable carriers including saline, dextrose, or 
water. The term parenteral as used herein includes subcutaneous, intravenous, intramuscular, 
intrasternal, infusion; or intraperitoneal administration. Suppositories for rectal administration of 
me drug can be prepared by mixing the drug with a suitable non-irritating excipient such as 
cocoa butter and polyethylene glycols that are solid at ordinary temperatures but liquid at the 
rectal temperature. 
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The dosage regimen for immunizing a host or otherwise treating a disorder or a disease 
with a composition of this invention is based on a variety of factors, including the type of 
disease, the age, weight, sex, medical condition of the patient, the severity of the condition, the 
route of administration, and the particular compound employed. For example, a poxviral vector 
5 may be administered as a composition comprising 1 x 10 6 infectious particles per dose. Thus, 
the dosage regimen may vary widely, but can be determined routinely using standard methods. 

. A prime-boost regimen may also be utilized (WO 01/30382 Al) in which the targeted 
immunogen is initially administered in a priming step in one form followed by a boosting step in 
which the targeted immunogen is administered in another form. The form of the targeted 
10 immunogen in the priming and boosting steps are different. For instance, if the priming step 
utilized a nucleic acid, the boost may be administered as a peptide. SimilarlyrWhere a priming 
step utilized one type of recombinant virus (i.e., ALVAC), the boost step may utilize another 
type of virus (i.e., NYVAC). This prime-boost method of administration has been shown to 
induce strong immunological responses. Various combinations of forms are suitable in 
.15 practicing the present invention. 

While the compositions of the invention can be administered as the sole active 
pharmaceutical agent, they can also be used in combination with one or more other compositions 
or agents (i.e., other immunogenic targets, co-stimulatory molecules, adjuvants). When 
administered as a combination, the individual components can be formulated as separate 
20 compositions administered at the same time or different times, or the components can be 
combined as a single composition. 

Injectable preparations, such as sterile injectable aqueous or oleaginous suspensions, may 
be formulated according to known methods using suitable dispersing or wetting agents and 
suspending agents. The injectable preparation may also be a sterile injectable solution or 
25 suspension in a non-toxic parenterally acceptable diluent or solvent. Suitable vehicles and 
. solvents that may be employed are water, Ringer's solution, and isotonic sodium chloride 
solution, among others. For instance, a viral vector such as a poxvirus may be prepared in 0.4% 
NaCl. In addition, sterile, fixed oils are conventionally employed as a solvent or suspending 
medium. For this purpose, any bland fixed oil may be employed, including synthetic mono- or 
30 diglycerides. In addition, fatty acids such as oleic acid find use in the preparation of injectables. 
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For topical administration, a suitable topical dose of a composition may be administered 
one to four, and preferably two or three times daily. The dose may also be administered with 
intervening days during which no does is applied. Suitable compositions may comprise from 
0.001% to 10% w/w, for example, from 1% to* 2% by weight of the formulation, although it may 

5 comprise as much as 10% w/w, but preferably not more than 5% w/w, and more preferably from 
0.1% to 1% of the formulation. Formulations suitable for topical administration include liquid or 
semi-liquid preparations suitable for penetration through the skin (e.g., liniments, lotions, 
ointments, creams, or pastes) and drops suitable for administration to the eye, ear, or nose. 

The pharmaceutical compositions may also be prepared in a solid form (including 

10 granules, powders or. suppositories). The pharmaceutical compositions may be subjected to 
conventional pharmaceutical operations such as sterilization and/or may contain conventional 
adjuvants, such as preservatives, stabilizers, wetting agents, emulsifiers, buffers etc. Solid 
dosage forms for oral administration may include capsules, tablets, pills, powders,, and granules. 
In such solid dosage forms, the active compound may be admixed with at least one inert diluent 

15 such as sucrose, lactose, or starch. Such dosage forms may also comprise, as in normal practice, 
additional substances other than inert diluents, e.g., lubricating agents such as magnesium 
stearate, In the case of capsules, tablets, and pills, the dosage forms may also comprise buffering 
agents. Tablets and pills can additionally be prepared with enteric coatings. Liquid dosage 
forms for oral administration may include pharmaceutical^ acceptable emulsions, solutions, 

20 suspensions, syrups, and elixirs containing inert diluents commonly used in the art, such as 
water. Such compositions may also comprise adjuvants, such as wetting sweetening, flavoring, 
and perfuming agents. 

Pharmaceutical compositions comprising a nucleic acid or polypeptide of the present 
invention may take any of several forms and may be administered by any of several routes. In 

25 preferred embodiments, the compositions are administered via a parenteral route (intradermal, 
intramuscular or subcutaneous) to induce an immune response in the host. Alternatively, the 
composition may be administered directly into a lymph node (intranodal) or tumor mass (i.e., 
intratumoral administration). For example, the dose could be administered subcutaneously at 
days 0, 7, and 14. Suitable methods for immunization using compositions comprising TAs are 

30 known in the art, as shown for p53 (Hollstein et al„ 1991), p21-ras (Almoguera et al., 1988), 

HER-2 (Fendly et .al., 1990), the melanoma-associated antigens (MAGE-1; MAGE-2) (van der 
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Bruggen et al., 1991), p97 (Hu et al., 1988), melanoma-associated antigen E (WO 99/30737) and 
carcinoembryonic antigen (CEA) (Kantor et al., 1993; Fishbein et al., 1992; Kaufinan et al., 
1991), among others. 

Preferred embodiments of administratable compositions include, for example, nucleic 
acids or polypeptides in liquid preparations such as suspensions, syrups, or elixirs. Preferred 
injectable preparations include, for example, nucleic acids or polypeptides suitable for parental, 
subcutaneous, intradermal, intramuscular or intravenous administration such as sterile, 
suspensions or emulsions. For example, a recombinant poxvirus may be in admixture with a 
suitable carrier, diluent, or excipient such as sterile water, physiological saline, glucose or the 
like. The composition may also be provided in lyophilized form for reconstituting, for instance, 
in isotonic aqueous^ saline buffer. In addition, the compositions can be co-administered or 
sequentially administered with other antineoplastic, anti-tumor or anti-cancer agents and/or with 
agents which reduce or alleviate ill effects of antineoplastic, anti-tumor or anti-cancer agents. 

A kit comprising a composition of the present invention is also provided. The kit can 
include a separate container containing a suitable carrier, diluent or excipient The kit can also 
include an additional anti-cancer, anti-tumor or antineoplastic agent and/or an agent that reduces 
or alleviates ill effects of antineoplastic, anti-tumor or anti-cancer agents for co- or sequential- 
administration. Additionally, the kit can include instructions for mixing or combining ingredients 
and/or administration. 

A better understanding of the present invention and of its many advantages will be had 
from the following examples, given by way of illustration. 

EXAMPLES 
Example 1 

Construction of the Multi-Antigen Construct vT416 

The expression vector vT416 (ALVAC-NY-ESO-l/Tip-2-LFA-3/ICAM-l/B7.1- 

E3L/K3L) was constructed in the ALVAC vector using standard techniques. DNA sequences 

encoding NY-ESCM, Tip-2, LFA-3, ICAM-1, B7.1, wE3L and wK3L were inserted into 

various loci within the ALVAC genome. DNA sequences encoding NY-ESO-1 (Chen et al. 

1997 PNAS 94:1914) and TRP-2 (Wang et al. 1996 J. Exp. Med. 184:2207) were inserted into 

27 



WO 2005/026370 



PCT/US2004/028751 



the C5 locus. DNA sequences encoding LFA-3 (Wallner, et al. (1987) J. Exp. Med. 166:923- 
932), ICAM-1 (Staunton, et al. (1988) Cell 52:925-933) and B7.1 (Chen, et al. (1992) Cell 
71:1093-1102) were inserted into the C3 locus. LFA-3, ICAM-1 and B7.1 form an expression 
cassette known as TRICOM. DNA sequences encoding wE3L (Chang, et al. 1992: Proc. Natl. 
Acad. Sci. U. S. A 89:4825-4829) and wK3L (Beattie, et al. 1991. Virology 183:419-422) were 
inserted into the C6 locus. Promoters were utilized as follows: 



Tabic 111 



DNA sequence 


Promoter 


E3L 


vaccinia E3L 


K3L 


vaccinia H6 


LFA-3 


vaccinia 30K 


ICAM-1 


vaccinia 13 


B7.1 


sE/L 


NY-ESO-1 


vaccinia H6 


. TRP-2 


sE/L 



10 

Promoter sE/L is described by Chakrabarti, et al. (BioTechniques 23: 1094-1097, 1997). 
The donor plasmids utilized are shown below: 



Table IV 



Plasmid 


Size (bp) 


Vector 


Antibiotic 
Resitance Gene 


PMPC6H6K3E3 




pBS-SK 


Amp . 


pALVAC.Tricom(C3) #33 


10,470 


pBS-SK 


Amp 


pT1132 


11,154 


pBS-SK 


Amp 



NY-ESO-1 and TRP-2 DNA sequences were inserted into the ALVAC donor plasmid 
.20 pT1132. This donor plasmid was then used with pALVAC.Tricom(C3) #33 to generate the 
ALVAC-TRICOM recombinant expressing these genes using standard techniques. The plasmids 
pALVAC.Tricom(C3) #33 and pT1132 are shown in Figure 1. The DNA sequences of 
pALVAC.Tricom(C3) #33 and pTl 132 are shown in Figures 2 and 3, respectively. 
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Example 2 

Construction of the Multi-Antigen Construct vT419 
The expression vector vT4 19 (ALVAC-gplOOM/Mart-1/ Mage-1,3 minigene-LFA- 
3/ICAM-1/B7.1-E3L/K3L) was constructed in the ALVAC vector using standard techniques. 

5 DNA sequences encoding the gpl00M/MART4/MAGE-l,3 minigene, LFA-3, ICAM-1, B7.1, 
wE3L and wK3L were inserted into various loci within the ALVAC genome. The gplOOM/ 
MART- 1 /MAGE- 1 ,3 minigene was inserted into the C5 locus. DNA sequences encoding LFA-3 
(Wallner, et al. (1987) J. Exp. Med 166:923-932), ICAM-1 (Staunton, et al. (1988) Cell 
52:925-933) and B7.1 (Chen, et al. (1992) Cell 71:i093-1102) were inserted into the C3 locus. 

10 LFA-3, ICAM-1 and B7.1 form an expression cassette known as TRICOM. DNA sequences 
encoding wE3L (Chang, et al. 1992. ?roc. Natl. Acad. Sci. U. S. A 89:4825-4829) and wK3L 

* (Beattie, et al; 199L Virology 183:419-422) were inserted into the C6 locus. Promoters were 
utilized as follows: 



Table V 



Gene 


Promoter 


E3L 


vaccinia E3L 


K3L 


vaccinia H6 


LFA-3 


vaccinia 30K 


ICAM-1 


vaccinia 13 


B7.1 


sE/L 


gplOOfM) 


vaccinia H6 


Mart-1 


vaccinia 42K 



Promoter sE/L is described by Chakrabarti, et al. (BioTechniques 23: 1094-1097, 1997). 
20 The donor plasmids utilized are shown below: 

Table VI 



Plasmid 


Size (bp) 


Vector 


Antibiotic 
Resitance Gene 


PMPC6H6K3E3 




pBS-SK 


Amp 


pALVAC.Tricoin(C3) #33 


10,470 


pBS-SK 


Amp 


PT3217 


11,465 


pBS-SK 


Amp 
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gplOO(M), Mart-1 and Mage-1,3 minigene were inserted into the ALVAC C5 donor 
plasmid P T3217. This donor plasmid was then used with pALVAC.Tricom(C3) #33 to generate 
the ALVAC-TRICOM recombinant expressing these genes using standard techniques. This 
donor plasmid inserts into the C5 site. pALVAC.Tricom(C3) #33 is shown in Figures 1 and 2. 
5 The pT3217 plasmid is shown in Figure 4. The DNA sequence of pT32 17 is shown in Figure 5. 

EXAMPLE 3 
Immunological Assessment of Multi-Antigen Vectors 
The results of the first animal experiment indicated a trend toward higher immunological 
10 responses to three (Mart 1, NY-ESO-1 and gplOO) of the four antigens when the vaccine was 
given as two separate injections. However, these'-dlfferences were not statistically significant. In 
detail, HLA-A2/K b transgenic mice (5/group) were immunized subcutaneously with vT419 
(ALVAC(2>gpl00M/MART-l/MAGE-l/3 minigene/TRICOM) and vT416 (ALVAC(2)-TRP- 
2/NY-ESO-l/TRICOM) either combined at one site or given as separate injections. Control mice 
15 were immunized with parental ALVAC(2). Mice were vaccinated three times (at three week 
intervals), and three weeks after the last boost T cell responses in individual mice were analyzed 
by IFN-g ELISPOT and CTL assays following in vitro restimulation with peptide. Compared to 
control animals, mice vaccinated with the multi-antigen vectors (at 2 sites) exhibited statistically 
significant ELISPOT responses against MART-1 . The IFN-gamma response to gpl DOM and NY- 
. 20 ESO-1 were also detectable, although these responses were not statistically significant due to 
response variability and the small number of cultures tested. ELISPOT responses, against the 
TRP-2 antigen were elevated in all groups tested (including control animals), presumably due to 
the fact that the dominant A2-restricted TRP-2 peptide (180-188) cross-reacts with H-2K b and 
can induce low avidity T cell responses in naive mice following in vitro, culture, and were 
25 therefore not statistically significant. Interestingly, ELISPOT responses in mice injected with an 
admixture of vT416 and VT419 were generally lower than in mice receiving each virus 
separately, although these differences did not achieve statistical significance. The CTL data were 
largely negative, except for one strong anti-gplOO response and one marginal anti-MART-1 
response, both of which occurred in mice vaccinated with vT416 and vT419 (two sites). Overall, 
30 these results provided encouraging data that establish that the multi-antigen vectors can generate 
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responses against MART-1, and suggest that anti-gplOO and anti-NY-ESO-1 responses can also 
be induced. 

Two additional pre-clinical animal studies, have been completed using the melanoma 
multi-antigen ALVAC recombinants. In these experiments, HLA-A2/K b transgenic mice 

5 (5/gn>up) were immunized subcutaneously with vT419 (ALVAC(2)-gpl00M/MART-l/MAGE- 
1/3 minigenefTRICOM) and vT416 (ALVAC(2)-TRP-2/NY-ESO-l/niICOM) either combined 
at one site or given as separate injections. Control mice were immunized with parental 
ALVAC(2). After vaccination, the T cell responses in individual mice were assessed by IFN- 
gamma ELISPOT assay following in vitro restimulation with peptide. Unlike the previous miilti- 

10 antigen experiment, which provided encouraging immunogenicity data, the two most recent 
studies generated inconclusive data, due to high background responses in control immunized 
animals. Therefore, overall the results were deemed as inconclusive. 

To confirm the immunogenicity of the multi-antigen constructs, and to repeat results 
from the first study, another pre-clinical animal study has been completed. HLA-A2/K b 

15 transgenic mice (10/group) were immunized subcutaneously with vT419 (ALVAC(2)- 
gpl00MZMART-l/\lAGE-l/3 minigene/TRICOM) and vT416 (ALVAC(2)-TRP-2/NY-ESO- 
1/TRICOM) given as separate injections. Control mice were immunized with parental 
ALVAC(2). Statistically significant ELISPOT responses were detectable against gplOO, Mart-1 
and TRP-2, and some responses were detected against NY-ESO-1, which were at the border of 

20 being statistically significant. 

While the present invention has been- described in terms of the preferred embodiments, it 
is understood that variations and modifications will occur to those skilled in the art. Therefore, it 
is intended that the appended claims cover all such equivalent variations that come within the 
25 scope of the invention as claimed 
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CLAIMS 

What is claimed is: 

1. An expression vector for co-expressing at least two immunogenic targets, wherein said 
immunogenic targets are selected from the group consisting of NY-ESO-1, TRP-2, gplOO, 

5 gplOOM, a MART antigen, MART-1, a MAGE antigen, MAGE-1, and MAGE-3. 

2. The expression vector of claim 1 wherein the vector is a plasmid or a viral vector 

3. The expression vector of claim 2 wherein the viral vector is selected from the group, 
consisting of poxvirus, adenovirus, retrovirus, herpesvirus, and adeno-associated virus. 

4. The expression vector of claim 3 wherein the viral vector is a poxvirus selected from the 
10 group consisting of vaccinia, NYVAC, avipox, canarypox, ALVAC, ALVAC(2), fowlpox, 

andTROVAC. 

5. The expression vector of claim 4 wherein the viral vector is a poxvirus selected from the 
group consisting of NYVAC, ALVAC, and ALVAC(2). 

6. The expression vector of claim 1 further comprising at least one nucleic sequence encoding 
15 an angiogenesis-associated antigen. 

7. The expression vector of claim 6 wherein the vector is a plasmid or a viral vector. 

8. The expression vector of claim 7 wherein the viral vector is selected from the group 
consisting of poxvirus, adenovirus, retrovirus, herpesvirus, and adeno-associated virus. 

9. The expression vector of claim 8 wherein the viral vector is a poxvirus selected from the 
20 group consisting of vaccinia, NYVAC, avipox, canarypox, ALVAC, ALVAC(2), fowlpox, 

andTROVAC. 

10. The expression vector of claim 9 wherein the viral vector is a poxvirus selected from the ; 
group consisting of NYVAC, ALVAC, and ALVAC(2). 

11. the expression vector of claim 1 or 6 further comprising at least one nucleic acid sequence 
25 encoding a co-stimulatory component 

12. The expression vector of claim 1 1 wherein the vector is a plasmid or a viral vector. 

13. The expression vector of claim 12 wherein the viral vector is selected from the group 
consisting of poxvirus, adenovirus, retrovirus, herpesvirus, and adeno-associated virus. 

14. The expression vector of claim 13 wherein the viral vector is a poxvirus selected frbm the 
30 group consisting of vaccinia, NYVAC, avipox, canarypox, ALVAC, ALVAC(2), fowlpox, 

andTROVAC. 
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15. The expression vector of claim 14 wherein the viral vector is a poxvirus selected from the 
group consisting of NYVAC, ALVAC, and ALVAC(2). 

16. The expression vector of any one claims 11-15 wherein the co-stimulatory component is 
human B7.1. 

5 17. A composition comprising an expression vector of any one of claims 1-16 in a 
phailnaceutically acceptable carrier. 

18. A method for preventing or treating cancer comprising administering to a host an expression 
vector of any one of claims 1-16. 

19. A method for preventing or treating cancer comprising administering to a host a composition 
10 of claim 17. 
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FIGURE 2 • 
DNA Sequence off pALVACTricom(C3) #33 

1 GGAAATTGTA AACGTTAATA TTTTGTTAAA ATTCGCGTTA AATTTTTGTT 
.. CCTTTAACAT TTGCAATTAT AAAACAATTT TAAGCGCAAT TTAAAAACAA 
.5 51 . AAATCAGCTC ATTTTTTAAC CAATAGGCCG AAATCGGCAA AATCCCTTAT 

TTTAGTCGAG TAAAAAATTG GTTATCCGGC TTTAGCCGTT TTAGGGAATA 
101 AAATCAAAAG AATAGACCGA GATAGGGTTG AGTGTTGTTC CAGTTTGGAA 
TTTAGTTTTC TTATCTGGCT CTATCCCAAC TCACAACAAG GTCAAACCTT 
151 CAAGAGTCCA CTATTAAAGA ACGTGGACTC CAACGTCAAA GGGCGAAAAA 
10 GTTCTCAGGT GATAATTTCT TGCACCTGAG GTTGCAGTTT CCCGCTTTTT 

201 CCGTCTATCA GGGCGATGGC CCACTACGTG AACCATCACC CTAATCAAGT 
GGCAGATAGT CCCGCTACCG GGTGATGCAC TTGGTAGTGG GATTAGTTCA 
251 TTTTTGGGGT CGAGGTGCCG TAAAGCACTA AATCGGAACC CTAAAGGGAG 
.AAAAACCCCA GCTCCACGGC ATTTCGTGAT TTAGCCTTGG GATTTCCCTC 
15 301 CCCCCGATTT AGAGCTTGAC GGGGAAAGCC GGCGAACGTG GCGAGAAAGG 

GGGdGCTAAA tctcgaactg cccctttcgg ccgcttgcac cgctctttcc 

■351 AAGGGAAGAA AGCGAAAGGA GCGGGCGCTA GGGCGCTGGC AAGTGTAGCG 

TTCCCTTCTT TCGCTTTCCT CGCCCGCGAT CCCGCGACCG TTCACATCGC 

401 GTCACGCTGC GCGTAACCAC CACACCCGCC GCGCTTAATG CGCCGCTACA 

20 CAGTGCGACG CGCATTGGTG GTGTGGGCGG CGCGAATTAC GCGGCGATGT 

451 GGGCGCGTCG CGCCATTCGC CATTCAGGCT GCGCAACTGT TGGGAAGGGC 

CCCGCGCAGC GCGGTAAGCG GTAAGTCCGA CGCGTTGACA ACCCTTCCCG 

501. GATCGGTGCG GGCCTCTTCG CTATTACGCC AGCTGGCGAA AGGGGGATGT 

CTAGCCACGC ,CCGGAGAAGC GATAATGCGG TCGACCGCTT TCCCCCTACA 

25 .551 GCTGCAAGGC GATTAAGTTG GGTAACGCCA GGGTTTTCCC AGTCACGACG 

CGACGTTCCG CTAATTCAAC CCATTGCGGT CCCAAAAGGG TCAGTGCTGC 

601 TTGTAAAACG ACGGCCAGTG AATTGTAATA CGACTCACTA TAGGGCGAAT 

AACATTTTGC TGCCGGTCAC TTAACATTAT GCTGAGTGAT ATCCCGCTTA 

651 . TGGGTACCGC GGCCGCGTCG ACATGCATTG TTAGTTCTGT AGATCAGTAA 

30 ACCCATGGCG CCGGCGCAGC TGTACGTAAC AATCAAGACA TCTAGTCATT 



Left Arm 

701 CGTATAGCAT ACGAGTATAA TTATCGTAGG TAGTAGGTAT CCTAAAATAA 
GCATATCGTA TGCTCATATT AATAGCATCC ATCATCCATA GGATTTTATT 



35 


751 


ATCTGATACA 
■ TAGACTATGT 


Left Arm 
GATAATAACT TTGTAAATCA 
CTATTATTGA AACATTTAGT 


ATTCAGCAAT TTCTCTATTA 
TAAGTCGTTA AAGAGATAAT 


40 


801 


TCATGATAAT 
AGTACTATTA 


Left Arm 
GATTAATACA CAGCGTGTCG 
CTAATTATGT GTCGCACAGC 


TTATTTTTTG 
AATAAAAAAC 


TTACGATAGT 
AATGCTATCA 


45 


851 


ATTTCTAAAG 
TAAAGATTTC 


Left Arm 
TAAAGAGCAG GAATCCCTAG 
ATTTCTCGTC CTTAGGGATC 


TATAATAGAA 
ATATTATCTT 


ATAATCCATA 
TATTAGGTAT 


50 


901 


Left Arm 

TGAAAAATAT. AGTAATGTAC ATATTTCTAA 
ACTTTTTATA TCATTACATG TATAAAGATT 


TGTTAACATA 
ACAATTGTAT 


TTTATAGGTA 
AAATATCCAT 




951 . 


AATCCAGGAA 
TTAGGTCCTT 


Left Arm 
GGGTAATTTT TACATATCTA 
CCCATTAAAA ATGTATAGAT 


TATACGCTTA .TTACAGTTAT 
ATATGCGAAT AATGTCAATA 
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c 


1001 


Left Arm ' 
TAAAAATATA CTTGCAAACA- TGTTAGAAGT 
ATTTTTATAT GAACGTTTGT ACAATCTTCA 


AAAAAAGAAA 
TTTTTTCTTT 


GAACTAATTT 
CTTGATTAAA 


D 


1051 


TACAAAGTGC 
ATGTTTCACG 


Left Arm' 
TTTACCAAAA TGCCAATGGA 
AAATGGTTTT ACGGTTACCT 


AATTACTTAG 
TTAATGAATC 


TATGTATATA 
ATACATATAT 


10 


1101 


ATGTATAAAG 
TACATATTTC 


Left Arm 
GTATGAATAT CACAAACAGC 
CATACTTATA GTGTTTGTCG 


AAATCGGCTA 
TTTAGCCGAT 


TTCCCAAGTT 
AAGGGTTCAA 


15 






Left Arm 








' 1151 


GAGAAACGGT 
CTCTTTGCCA 


ATAATAGATA TATTTCTAGA 
TATTATCTAT ATAAAGATCT 


TACCATTAAT 
ATGGTAATTA 


AACCTTATAA 
TTGGAATATT 


20 . 


1201 


Left Arm 

GCTTGACGTT TCCTATAATG CCTACTAAGA 

CGAACTGCAA aggatattac ggatgattct 


AAACTAGAAG 
TTTGATCTTC 


ATACATACAT- 
TATGTATGTA 


25 


1251 


Left Arm 

ACTAACGCCA TACGAGAGTA actactcatc 
TGATTGCGGT atgctctcat TGATGAGTAG 


GTATAACTAC TGTTGCTAAC 
CATATTGATG ACAACGATTG 


30 


1301 


AGTGACACTG 
TCACTGTGAC 


Left Arm 
ATGTTATAAC TCATCTTTGA 
TACAATATTG AGTAGAAACT 


TGTGGTATAA 
ACACCATATT 


ATGTATAATA 
TACATATTAT 


35 


1351 


Left Arm 

ACTATATTAC ACTGGTATTT TATTTCAGTT ATATACTATA TAGTATTAAA 
' TGATATAATG TGACCATAAA ATAAAGTCAA TATATGATAT ATCATAATTT 


1401 


AATTATATTT 
TTAATATAAA 


Left Arm 

GTATAATTAT ATTATTATAT TCAGTGTAGA AAGTAAAATA 
CATATTAATA TAATAATATA AGTCACATCT TTCATTTTAT 


40 


1451 


CTATAAATAT 
GATATTTATA 


Left Arm 
GTATCTCTTA TTTATAACTT 
CATAGAGAAT AAATATTGAA 


ATTAGTAAAG 
TAATCATTTC 


TATGTACTAT 
ATACATGATA . 


45 


1501 


TCAGTTATAT 
AGTCAATATA 


Left Arm 
TGTTTTATAA AAGCTAAATG 
ACAAAATATT TTCGATTTAC 


CTACTAGATT 
GATGATCTAA 


GATATAAATG 
CTATATTTAC 


50 


1551 


AATATGTAAT 
TTATACATTA 


Left Arm 
AAATTAGTAA TGTAGTATAC 
TTTAATCATT ACATCATATG 


TAATATTAAC TCACATTTGA 
ATTATAATTG AGTGTAAACT 








Left Arm 




30K P 


55 


1601 


CTAATTAGCT 
GATTAATCGA 


ATAAAAACCC TAAGGTAGGC 
TATTTTTGGG ATTCCATCCG 


GGCCGCACTA 
CCGGCGTGAT 


GAGGATTCGA 
CTCCTAAGCT 
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30K Pr. . 



5 


1651 


CAAACACCAA TAATTCCCTT CTCTTCATTC CGGACATTAA ATTGGCTATA 
GTTTGTGGTT ATTAAGGGAA GAGAAGTAAG GCCTGTAATT TAACCGATAT 

30K'Pr. 


.10 


1701 


GATAATAAAG 
CTATTATTTC 


ACATTGAGAT GTTACAGGCT 
TGTAACTCTA CAATGTCCGA 
30K Pr. 


CTGTTCAAAT 
GACAAGTTTA 


ACGACATTAA 
TGCTGTAATT 


1751 


TATCTATTCT 
ATAGATAAGA 


GCTAATCTGG AAAATGTACT 
CGATTAGACC TTTTACATGA 
30K Pr. 


ATTGGATGAT 
TAACCTACTA 


GCCGAAATAG 
CGGCTTTATC 


15 


1801 


CTAAAATGAT 
GATTTTACTA 


TATAGAAAAG CATGTTGAAT 
ATATCTTTTC GTACAACTTA 
3 OK Pr. 


ACAAGTCTGA 
TGTTCAGACT 


CTCCTATACA 
GAGGATATGT 


20 


1851 


AAAGATCTCG 
TTTCTAGAGC 


ATATAGTCAA GAATAATAAA 
TATATCAGTT CTTATTATTT 
30K Pri 


TTGGATGAAA 
AACCTACTTT 


TAATTAGCAA 
ATTAATCGTT 


25 


1901 


AAACAAGGAA 
TTTGTTCCTT 


CTCAGACTCA TGTACGTCAA 
GAGTCTGAGT ACATGCAGTT 
30K Pr. 


TTGTGTAAAG AAAAACTAA'T 
AACACATTTC TTTTTGATTA 


30 




TAGATTCTCC 
ATCTAAGAGG 


CACATTTTTG TTAACATTAC 
GTGTAAAAAC AATTGTAATG 
30K Pr. 


ACTAACTAAT 
TGATTGATTA 


TGGTAAAATT 
ACCATTTTAA 


2001 - 


GATAGAATAA 
CTATCTTATT 


TTATGTGTAT ATAAGATAGA 
AATACACATA TATTCTATCT 
30K Pr. 


TTTCCTATTG 
AAAGGATAAC 


TCTTACTCAT 
AGAATGAGTA 


35 








hLFA-3 




2051 


TGCATCGTGG 
ACGTAGCACC 


GAATTGAGAT CAGCTTCCGC 
CTTAAGTCTA GTCGAAGGCG 


GGCATGGTTG 
CCGTACCAAC 


CTGGGAGCGA 
GACCCTCGCT 
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hLFA-3 



5 


2101 


CGCGGGGCGG 
GCGCCCCGCC 


GCCCTGGGGG TCCTCAGCGT GGTCTGCCTG CTGCACTGCT 
CGGGACCCCC AGGAGTCGCA CCAGACGGAC GACGTGACGA 
hLFA-3 


10 


2151 


TTGGTTTCAT 
AACCAAAGTA 


CAGCTGTTTT TCCCAACAAA TATATGGTGT TGTGTATGGG 
GTCGACAAAA AGGGTTGTTT ATATACCACA ACACATACCC 
hLFA-3 . 


2201 


. AATGTAACTT 
TTACATTGAA 


TCCATGTACC AAGCAATGTG CCTTTAAAAG AGGTCCTATG 
AGGTACATGG TTCGTTACAC GGAAATTTTC TCCAGGATAC 
hLFA-3 


15 


2251 * 


GAAAAAACAA 
CTTTTTTGTT 


AAGGATAAAG TTGCAGAACT GGAAAATTCT GAATTCAGAG 
TTCCTATTTC AACGTCTTGA CCTTTTAAGA CTTAAGTCTC 
hLFA-3 


20 - 


2301 


CTTTCTCATC 
GAAAGAGTAG 


TTTTAAAAAT AGGGTTTATT TAGACACTGT GTCAGGTAGC 
AAAATTTTTA TCCCAAATAA ATCTGTGACA CAGTCCATCG 
hLFA-3 


25 


2351 


CTCACTATCT 
GAGTGATAGA 


ACAACTTAAC ATCATCAGAT GAAGATGAGT ATGAAATGGA 
TGTTGAATTG TAGTAGTCTA CTTCTACTCA TACTTTACCT 
hLFA-3 


30 


2401 


ATCGCCAAAT 
TAGCGGTTTA 


ATTACTGATA CCATGAAGTT CTTTCTTTAT GTGfCTTGAGT 
TAATGACTAT GGTACTTCAA GAAAGAAATA CACGAACTCA 
hLFA-3 


2451 


CTCTTCCATC 
GAGAAGGTAG 


TCCCACACTA ACTTGTGCAT TGACTAATGG AAGCATTGAA 
AGGGTGTGAT TGAACACGTA ACTGATTACC TTCGTAACTT 
hLFA-3 


35 


2501 


GTCCAATGCA 
CAGGTTACGT 


TGATACCAGA GCATTACAAC AGCCATCGAG GACTTATAAT 
ACTATGGTCT CGTAATGTTG TCGGTAGCTC CTGAATATTA 
hLFA-3 


40 


2551 


GTACTCATGG GATTGTCCTA TGGAGCAATG TAAACGTAAC TCAACCAGTA. 
CATGAGTACC CTAACAGGAT ACCTCGTTAC ATTTGCATTG AGTTGGTCAT 

hLFA-3 


45 


ZbOl 


TATATTTTAA GATGGAAAAT GATCTTCCAC AAAAAATACA GTGTACTCTT 
ATATAAAATT CTACCTTTTA CTAGAAGGTG TTTTTTATGT CACATGAGAA ' 

hLFA-3 


50 


2651 


AGCAATCCAT 
TCGTTAGGTA 


TATTTAATAC AACATCATCA ATCATTTTGA CAACCTGTAT 
ATAAATTATG TTGTAGTAGT TAGTAAAACT GTTGGACATA 
hLFA-3 


2701 


CCCAAGCAGC 
GGGTTCGTCG 


GGTCATTCAA GACACAGATA TGCACTTATA CCCATACCAT 
CCAGTAAGTT CTGTGTCTAT ACGTGAATAT GGGTATGGTA 
. hLFA-3 



55 2751 TAGCAGTAAT TACAACATGT ATTGTGCTGT ATATGAATGG TATTCTGAAA 



ATCGTCATTA ATGTTGTACA TAACACGACA TATACTTACC ATAAGACTTT 
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hLFA-3 






13 Pr. 


2801 


TGTGACAGAA 
ACACTGTCTT 


AACCAGACAG 
TTGGTCTGTC 


•AACCAACTCC 
TTGGTTGAGG 
13 Pr. 


AATTGATTGG 
TTAACTAAPP 


CTCGACCGGG 


2851' 


AATGTACTAT 
TT AC AT GAT A 


CTACGTACGA 
GATGCATGCT 


AACCCGCATC 
TTGGGCGTAG 
13 Pr. 


CGCTCCCATT 
GCGAGGGTAA 


CAATTCACAT 
GTTAAGTGTA 


2901 


. TGGACAAGGA 
ACCTGTTCCT 


TAAAATAAAA 
ATTTTATTTT 


CCACTGGTGG 
GGTGACCACC 
13 Pr. 


TTTGCGATTC 
AAACGCTAAG 


CGAAATCTGT 
GCTTTAGACA 


2951 " 


ACATCATGCA 
TGTAGTACGT 


GTGGTTAAAC 
CACCAATTTG 


AAAAACATTT 
TTTTTGTAAA 
13 Pr. 


TTATTCTCAA ATGAGATAAA' 
AATAAGAGTT TACTCTATTT 


3001 


GTGAAAATAT 
CACTTTTATA 
13 Pr. 


ATATCATTAT 
TATAGTAATA 


ATTACAAAGT ACAATTATTT 
TAATGTTTCA TGTTAATAAA 
hICAM 


AGGTTTAATC 
TCCAAATTAG 


3051 


AATCCCGCGG 
TTAGGGCGCC 


GCTATGGCTC 
CGATACCGAG 


CCAGCAGCCC 
GGTCGTCGGG 
hi CAM 


CCGGCCCGCG 
GGCCGGGCGC 


CTGCCCGCAC 
GACGGGCGTG 


3101 


TCCTGGTCCT 
AGGACCAGGA 


GCTCGGGGCT 
CGAGCCCCGA 


CTGTTCCCAG 
GACAAGGGTC 
hi CAM 


GACCTGGCAA 
CTGGACCGTT 


TGCCCAGACA 
ACGGGTCTGT 


3151 


TCTGTGTCCC 
AGACACAGGG 


CCTCAAAAGT 
GGAGTTTTCA 


CATCCTGCCC 
GTAGGACGGG 
hICAM 


CGGGGAGGCT CCGTGCTGGT 
GCCCCTCCGA GGCACGACCA 



3201 " GACATGCAGC ACCTCCTGTG ACCAGCCCAA GTTGTTGGGC ATAGAGACCC 
CTGTACGTCG TGGAGGACAC TGGTCGGGTT CAACAACCCG TATCTCTGGG 

hi CAM 



3251 CGTTGCCTAA AAAGGAGTTG CTCCTGCCTG GGAACAACCG GAAGGTGTAT 
GCAACGGATT TTTCCTCAAC GAGGACGGAC CCTTGTTGGC CTTCCACATA 

hi CAM 

3301 GAACTGAGCA ATGTGCAAGA AGATAGCCAA CCAATGTGCT ATTCAAACTG 
CTTGACTCGT TACACGTTCT TCTATCGGTT GGTTACACGA TAAGTTTGAC ' 

hi CAM 



3351 CCCTGATGGG CAGTCAACAG CTAAAACCTT CCTCACCGTG TACTGGACTC 
GGGACTACCC GTCAGTTGTC GATTTTGGAA GGAGTGGCAC ATGACCTGAG 

hICAM 



3401 CAGAACGGGT GGAACTGGCA CCCCTCCCCT CTTGGCAGCC AGTGGGCAAG 
GTCTTGCCCA CCTTGACCGT GGGGAGGGGA GAACCGTCGG TCACCCGTTC 

. hICAM 

34-51 AACCTTACCC TACGCTGCCA GGTGGAGGGT GGGGCACCCC GGGCCAACCT 
TTGGAATGGG ATGCGACGGT CCACCTCCCA CCCCGTGGGG CCCGGTTGGA 
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hi CAM 



5 


3501 


CACCGTGGTG CTGCTCCGTG GGGAGAAGGA 
GTGGCACCAC GACGAGGCAC CCCTCTTCCT 

hICAM 


GCTGAAACGG 
CGACTTTGCC 


: GAGCCAGCTG 
CTCGGTCGAC 


10 


3551 


TGGGGGAGCC CGCTGAGGTC ACGACCACGG 
ACCCCCTCGG GCGACTCCAG TGCTGGTGCC 

hi CAM 


TGCTGGTGAG 
ACGACCACTC 


GAGAGATCAC 
CTCTCTAGTG 


3601 


CATGGAGCCA ATTTCTCGTG CCGCACTGAA 
GTACCTCGGT TAAAGAGCAC GGCGTGACTT 

hICAM 


CTGGACCTGC 
GACCTGGACG 


GGCCCCAAGG 
CCGGGGTTCC 


15 


3651 


GCTGGAGCTG TTTGAGAACA CCTCGGCCCC 
CGACCTCGAC AAACTCTTGT GGAGCCGGGG 

hICAM" 


CTACCAGCTC 
GATGGTCGAG 


CAGACCTTTG 
GTCTGGAAAC 


20 


3701 


TCCTGCCAGC GACTCCCCCA CAACTTGTCA 
AGGACGGTCG CTGAGGGGGT GTTGAACAGT 

hi CAM 


GCCCCCGGGT 
CGGGGGCCCA 


CCTAGAGGTG 
GGATCTCCAC 


.25 


3751 


GACACGCAGG GGACCGTGGT CTGTTCCCTG 
CTGTGCGTCC CCTGGCACCA GACAAGGGAC 

hICAM 


GACGGGCTGT 
CTGCCCGACA 


TCCCAGTCTC 
AGGGTCAGAG 


30 


3801 


GGAGGCCCAG GTCCACCTGG CACTGGGGGA 
CCTCCGGGTC CAGGTGGACC GTGACCCCCT 

hi CAM 


CCAGAGGTTG 
GGTCTCCAAC 


AACCCCACAG 
TTGGGGTGTC 


3851 


TCACCTATGG CAACGACTCC TTCTCGGCCA AGGCCTCAGT 
AGTGGATACC GTTGCTGAGG AAGAGCCGGT TCCGGAGTCA 

hICAM 


CAGTGTGACC 
GTCACACTGG 


35 


3901 


GCAGAGGACG AGGGCACCCA GCGGCTGACG 
CGTCTCCTGC TCCCGTGGGT CGCCGACTGC 

hICAM 


TGTGCAGTAA TACTGGGGAA 
ACACGTCATT ATGACCCCTT 


40 


3951 


CCAGAGCCAG GAGACACTGC AGACAGTGAC 
GGTCTCGGTC CTCTGTGACG TCTGTCACTG 

hICAM 


CATCTACAGC 
GTAGATGTCG 


TTTCCGGCGC 
AAAGGCCGCG 


45 




CCAACGTGAT TCTGACGAAG CCAGAGGTCT 
GGTTGCACTA AGACTGCTTC GGTCTCCAGA 

hi CAM 


CAGAAGGGAC 
GTCTTCCCTG 


CGAGGTGACA 
GCTCCACTGT 


50 


4051 


GTGAAGTGTG AGGCCCACCC TAGAGCCAAG GTGACGCTGA ATGGGGTTCC 
CACTTCACAC. TCCGGGTGGG ATCTCGGTTC CACTGCGACT TACCCCAAGG 

hICAM 


4101 


AGCCCAGCCA* CTGGGCCCGA GGGCCCAGCT 
TCGGGTCGGT GACCCGGGCT CCCGGGTCGA 

hICAM 


CCTGCTGAAG 
GGACGACTTC 


GCCACCCCAG " 
CGGTGGGGTC 



55 4151 AGGACAACGG GCGCAGCTTC TCCTGCTCTG CAACCCTGGA GGTGGCCGGC 
TCCTGTTGCC CGCGTCGAAG AGGACGAGAC GTTGGGACCT CCACCGGCCG 
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hICAM 



5 


4201 


CAGCTTATAC ACAAGAACCA GACCCGGGAG. CTTCGTGTCC TGTATGGCCC 
GTCGAATATG TGTTCTTGGT CTGGGCCCTC GAAGCACAGG ACATACCGGG 

hi CAM 


10 


4251 


CCGACTGGAC 
GGCTGACCTG 


GAGAGGGATT 
CTCTCCCTAA 


GTCCGGGAAA CTGGACGTGG CCAGAAAATT 
CAGGCCCTTT GACCTGCACC GGTCTTTTAA 
hICAM 


4301 


. CCCAGCAGAC 
GGGTCGTCTG 


TCCAATGTGC 
AGGTTACACG 


CAGGCTTGGG 
GTCCGAACCC 
hi CAM 


GGAACCCATT 
CCTTGGGTAA 


GCCCGAGCTC 
CGGGCTCGAG 


15 


4351' 


AAGTGTCTAA 
TTCACAGATT 


AGGATGGCAC 
TCCTACCGTG 


TTTCCCACTG 
AAAGGGTGAC 
hi CAM 


CCCATCGGGG 
GGGTAGCCCC 


AATCAGTGAC 
TTAGTCACTG 


20 


4401 


TGTCACTCGA 
ACAGTGAGCT 


GATCTTGAGG 
CTAGAACTCC 


GCACCTACCT 
CGTGGATGGA 
hi CAM 


CTGTCGGGCC AGGAGCACTC 
GACAGCCCGG TCCTCGTGAG 


25 


4451 


AAGGGGAGGT 
TTCCCCTCCA 


CACCCGCGAG 
GTGGGCGCTC 


GTGACCGTGA 
CACTGGCACT 
hi CAM 


ATGTGCTCTC 
TACACGAGAG 


CCCCCGGTAT 
GGGGGCCATA 


30 


4501 


GAGATTGTCA 
CTCTAACAGT 


TCATCACTGT 
AGTAGTGACA 


GGTAGCAGCC 
CCATCGTCGG 
hICAM 


GCAGTCATAA 
CGTCAGTATT 


TGGGCACTGC 
ACCCGTGACG 


4551 


AGGCCTCAGC 
' TCCGGAGTCG 


ACGTACCTCT 
TGCATGGAGA 


ATAACCGCCA 
TATTGGCGGT 
hi CAM 


GCGGAAGATC 
CGCCTTCTAG 


AAGAAATACA 
TTCTTTATGT 


35 


4601 ' 


GACTACAACA GGCCCAAAAA GGGACCCCCA TGAAACCGAA CACACAAGCC 
CTGATGTTGT CCGGGTTTTT CCCTGGGGGT ACTTTGGCTT GTGTGTTCGG 
hi CAM sE/L Pr. 


40 


4651 


ACGCCTCCCT GAGCATGCAT 
TGCGGAGGGA CTCGTACGTA 
SE/L Pr. 


GTAGCTTAAA 
CATCGAATTT 


AATTGAAATT 
TTAACTTTAA 


TTATT.TTTTT. 
AATAAAAAAA 


45 


4701 


TTTTTGGAAT 
AAAAACCTTA 


ATAAATAAGC 
TATTTATTCG 


TCGAAGTCGA 
AGCTTCAGCT 
hB7 . 1 


AATTCCTGCA 
TTAAGGACGT 


GCCCGGGGCC 
CGGGCCCCGG 


50 


4751 


ATGGGCCACA 
TACCCGGTGT 


CACGGAGGCA 
GTGCCTCCGT 


GGGAACATCA 
CCCTTGTAGT 
hB7.1 


CCATCCAAGT 
GGTAGGTTCA 


GTCCATACCT 
CAGGTATGGA 


4801 


CAATTTCTTT 
GTTAAAGAAA 


CAGCTCTTGG 
GTCGAGAACC 


TGCTGGCTGG 
ACGACCGACC 
hB7.1 


TCTTTCTCAC 
AGAAAGAGTG 


TTCTGTTCAG 
AAGACAAGTC 


55 


. 4851 


GTGTTATCCA 
CACAATAGGT 


CGTGACCAAG 
GCACTGGTTC 


GAAGTGAAAG AAGTGGCAAC 
CTTCACTTTC TTCACCGTTG 


GCTGTCCTGT 
CGACAGGACA 
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hB7.1 



5 


4901 


GGTCACAATG 
CCAGTGTTAC 


TTTCTGTTGA 
AAAGACAACT 


> AGAGCTGGCA 
TCTCGACCGT 
hB7.1 


CAAACTCGCA 
GTTTGAGCGT 


TCTACTGGCA 
AGATGACCGT 


10 


4951 


AAAGGAGAAG 
TTTCCTCTTC 


AAAATGGTGC 
TTTTACCACG 


TGACTATGAT 
ACTGATACTA 
hB7.1 


GTCTGGAGAC 
CAGACCTCTG 


ATGAATATAT 
TACTTATATA 


• 5001 


GGCCCGAGTA 
CCGGGCTCAT 


CAAGAACCGG 
GTTCTTGGCC 


ACCATCTTTG 
TGGTAGAAAG 
hB7.1 


ATATCACTAA 
TATAGTGATT 


TAACCTCTCC 
ATTGGAGAGG 


15 


5051 


ATTGTGATCC 
TAACACTAGG 


TGGCTCTGCG 
ACCGAGACGC 


CCCATCTGAC 
GGGTAGACTG 
hB7.1" 


GAGGGCACAT 
CTCCCGTGTA 


ACGAGTGTGT 
TGCTCACACA 


20 


5101 


TGTTCTGAAG 
ACAAGACTTC 


TATGAAAAAG 
ATACTTTTTC 


ACGCTTTCAA 
TGCGAAAGTT 
hB7.1 


GCGGGAACAC 
CGCCCTTGTG 


CTGGCTGAAG 
* GACCGACTTC 


25 


5151 


TGACGTTATC 
'ACTGCAATAG 


AGTCAAAGCT 
TCAGTTTCGA 


GACTTCCCTA 
CTGAAGGGAT 
hB7.1 


CACCTAGTAT 
GTGGATCATA 


ATCTGACTTT 
TAGACTGAAA 


30 


5201 


GAAATTCCAA 
CTTTAAGGTT 


CTTCTAATAT 
GAAGATTATA 


TAGAAGGATA 
ATCTTCCTAT 
hB7.1 


ATTTGCTCAA. 
TAAACGAGTT 


CCTCTGGAGG 
GGAGACCTCC 


5251 


TTTTCCAGAG 
AAAAGGTCTC 


CCTCACCTCT 
GGAGTGGAGA 


CCTGGTTGGA 
GGACCAACCT 
hB7.1 


AAATGGAGAA 
TTTACCTCTT 


GAATTAAATG 
CTTAATTTAC 


35 • 


5301 


CCATCAACAC 
GGTAGTTGTG 


AACAGTTTCC 
TTGTCAAAGG 


CAAGATCCTG 
GTTCTAGGAC 
hB7.1 


AAACTGAGCT 
TTTGACTCGA 


CTATGCTGTT 
GATACGACAA 


40 


535*1 


AGCAGCAAAC 
TCGTCGTTTG 


TGGATTTCAA 
ACCTAAAGTT 


TATGACAACC AACCACAGCT TCATGTGTCT 
ATACTGTTGG TTGGTGTCGA AGTACACAGA 
hB7.1 


45 




CATCAAGTAT 
GTAGTTCATA 


GGACATTTAA 
CCTGTAAATT 


GAGTGAATCA 
CTCACTTAGT 
hB7.1 


GACCTTCAAC 
CTGGAAGTTG 


TGGAATACAA 
ACCTTATGTT 


50 


54 51 


CCAAGCAAGA 
GGTTCGTTCT 


GCATTTTCCT 
CGTAAAAGGA 


GATAACCTGC TCCCATCCTG 
CTATTGGACG AGGGTAGGAC 
hB7.1. 


GGCCATTACC 
CCGGTAATGG 


5501 


TTAATCTCAG* TAAATGGAAT TTTCGTGATA TGCTGCCTGA 
AATTAGAGTC ATTTACCTTA AAAGCACTAT ACGACGGACT 

hB7 . 1 


CCTACTGCTT ' 
GGATGACGAA 



55 5551 - TGCCCCACGC TGCAGAGAGA GAAGGAGGAA TGAGAGATTG AGAAGGGAAA 
ACGGGGTGCG ACGTCTCTCT CTTCCTCCTT ACTCTCTAAC TCTTCCCTTT 
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hB7 . 1 

5601 GTGTACGCCC TGTATAAAAG CTTTCTAGGT TTTTGTTTAG GGCTGCAGGA 
CACATGCGGG ACATATTTTC GAAAGATCCA AAAACAAATC CCGACGTCCT 1 
5 5651 ATTCCTCGAG GGATCCCGAT TTTTATGACT AGTTAATCAA ATAAAAAGCA 
TAAGGAGCTC CCTAGGGCTA AAAATACTGA TCAATTAGTT TATTTTTCGT 

Right Arm 

57 01 TACAAGCTAT TGCTTCGCTA TCGTTACAAA ATGGCAGGAA TTTTGTGTAA 
ATGTTCGATA ACGAAGCGAT AGCAATGTTT TACCGTCCTT AAAACACATT 



10 


5751 


ACTAAGCCAC 
TGATTCGGTG 


Right Arm 
ATACTTGCCA ATGAAAAAAA 
TATGAACGGT TACTTTTTTT 


TAGTAGAAAG 
ATCATCTTTC 


GATACTATTT 
CTATGATAAA 


15 


5801 


TAATGGGATT 
ATTACCCTAA 


Right Arm 
AGATGTTAAG GTTCCTTGGG 
TCTACAATTC CAAGGAACCC 


ATTATAGTAA 
TAATATCATT 


CTGGGCATCT 
GACCCGTAGA 


20 


5851 


GTTAACTTTT 
CAATTGAAAA 


Right Arm 
ACGACGTTAG GTTAGATACT 
TGCTGCAATC CAATCTATGA 


GATGTTACAG 
CTACAATGTC 


ATTATAATAA 
TAATATTATT- 


25 


5901 


Right Arm 
TGTTACAATA AAATACATGA CAGGATGTGA 
ACAATGTTAT TTTATGTACT GTCCTACACT 


TATTTTTCCT 
ATAAAAAGGA 


CATATAACTC 
GTATATTGAG 


30 


5951 


TTGGAATAGC 
AACCTTATCG 


Right. Arm 
AAATATGGAT CAATGTGATA 
TTTATACCTA GTTACACTAT 


GATTTGAAAA 
CTAAACTTTT 


TTTCAAAAAG 
AAAGTTTTTC 


6001 


CAAATAACTG 
GTTTATTGAC 


Right " Arm 
ATCAAGATTT ACAGACTATT 
TAGTTCTAAA TGTCTGATAA 


TCTATAGTCT 
AGATATCAGA 


GTAAAGAAGA 
CATTTCTTCT 


35 


6051 


GATGTGTTTT 
CTACACAAAA 


Right Arm 
CCTCAGAGTA ACGCCTCTAA 
GGAGTCTCAT TGCGGAGATT 


ACAGTTGGGA 
TGTCAACCCT 


GCGAAAGGAT 
CGCTTTCCTA 


40 


6101 


GCGCTGTAGT 
. • CGCGACATCA 


Right Arm 
TATGAAACTG GAGGTATCTG 
ATACTTTGAC CTCCATAGAC 


ATGAACTTAG 
TACTTGAATC 


AGCCCTAAGA 
TCGGGATTCT 


45 


6151 


AATGTTCTGC 
TTACAAGACG 


Right Arm 
TGAATGCGGT ACCCTGTTCG 
ACTTACGCCA TGGGACAAGC 


AAGGACGTGT 
TTCCTGCACA 


TTGGTGATAT 
AACCACTATA 


50 


6201 


CACAGTAGAT 
GTGTCATCTA 


Right Arm 
AATCCGTGGA ATCCTCACAT 
TTAGGCACCT TAGGAGTGTA 


AACAGTAGGA 
TTGTCATCCT 


TATGTTAAGG 
ATACAATTCC 


6251 


AGGACGATGT 
TCCTGCTACA 


Right Arm 
CGAAAACAAG AAACGCCTAA 
GCTTTTGTTC TTTGCGGATT 


TGGAGTGCAT 
ACCTCACGTA 


GTCCAAGTTT 
CAGGTTCAAA 


55 






Right Arm 
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6301 


AGGGGGCAAG 
TCCCCCGTTC 


AAATACAAGT TCTAGGATGG 
TTTATGTTCA AGATCCTACC 


TATTAATAAG 
ATAATTATTC 


TATCTAAGTA 
ATAGATTCAT 


5 • 


6351 


TTTGGTATAA 
AAACCATATT 


Right Arm 
TTTATTAAAT AGTATAATTA 
AAATAATTTA TCATATTAAT 


TAACAAATAA 
ATTGTTTATT 


TAAATAACAT 
ATTTATTGTA 


10 


6401 


GATAACGGTT 
• CTATTGCCAA 


Right Arm 
TTTATTAGAA TAAAATAGAG 
AAATAATCTT ATTTTATCTC 


ATAATATCAT 
TATTATAGTA 


AATGATATAT 
TTACTATATA 


15 


6451 


AATACTTCAT- 
TTATGAAGTA 


Right. Arm 
TACCAGAAAT GAGTAATGGA 
ATGGTCTTTA CTCATTACCT 


AGACTTATAA 
TCTGAATATT 


ATGAACTGCA 
TACTTGACGT 


6501 


TAAAGCTATA 
ATTTCGATAT 


Right Arm 
AGGTATAGAG ATATAAATTT AGTAAGGTAT 
TCCATATCTC TATATTTAAA TCATTCCATA 


ATACTTAAAA 
TATGAATTTT 


20 


6551 


AATGCAAATA 
TTACGTTTAT 


Right Arm 
CAATAACGTA AATATACTAT CAACGTCTTT 
GTTATTGCAT TTATATGATA GTTGCAGAAA 


GTATTTAGCC 
CATAAATCGG 


25 


6601 


GTAAGTATTT 
CATTCATAAA 


Right .Arm 
CTGATATAGA AATGGTAAAA 
GACTATATCT TTACCATTTT 


TTATTACTAG 
AATAATGATC 


AACACGGTGC 
TTGTGCCACG 


30 


6651 


CGATATTTTA 
. GCTATAAAAT 


Right Arm 
AAATGTAAAA ATCCTCCTCT 
TTTACATTTT TAGGAGGAGA 


TCATAAAGCT 
AGTATTTCGA 


GCTAGTTTAG 
CGATCAAATC 


35 


6701 


ATAATACAGA 
TATTATGTCT 


Right Arm 
AATTGCTAAA CTACTAATAG ATTCTGGCGC 
TTAACGATTT GATGATTATC TAAGACCGCG 


TGACATAGAA 
ACTGTATCTT 


6751. 


CAGATACATT 
GTCTATGTAA 


Right Arm 
CTGGAAATAG TCCGTTATAT 
GACCTTTATC AGGCAATATA 


ATTTCTGTAT 
TAAAGACATA 


ATAGAAACAA 
TATCTTTGTT 


40 


6801 


TAAGTCATTA 
ATTCAGTAAT 


Right Arm 
ACTAGATATT TATTAAAAAA AGGTGTTAAT 
TGATCTATAA ATAATTTTTT TCCACAATTA 


TGTAATAGAT 
ACATTATCTA 






TCTTTCTAAA 
AGAAAGATTT 


Right Arm 
TTATTACGAT GTACTGTATG 
AATAATGCTA CATGACATAC 


ATAAGATATC 
TATTCTATAG 


TGATGATATG 
ACTACTATAC 


50 


6901 


TATAAAATAT 
ATATTTTATA 


Right Arm 
TTATAGATTT TAATATTGAT 
AATATCTAAA ATTATAACTA 


CTTAATATAC 
GAATTATATG 


AAACTAGAAA 
TTTGATCTTT 


55 


6951 


TTTTGAAACT 
AAAACTTTGA 


Right Arm 
CCGTTACATT ACGCTATAAA 
GGCAATGTAA TGCGATATTT 


GTATAAGAAT 
CATATT.CTTA 


ATAGATTTAA 
TATCTAAATT 



Right Arm 
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7001 TTAGGATATT GTTAGATAAT AGTATTAAAA TAGATAAAAG TTTATTTTTG 
AATCCTATAA CAATCTATTA TCATAATTTT ATCTATTTTC AAATAAAAAC 



5 


7051 


CATAAACAGT 
GTATTTGTCA 


Right Arm 

ATCTCATAAA GGCACTTAAA AATAATTGTA GTTACGATAT 
TAGAGTATTT CCGTGAATTT- TTATTAACAT CAATGCTATA 


10 


7101 


AATAGCGTTA 
• TTATCGCAAT 


Right Arm 

CTTATAAATC ACGGAGTGCC TATAAACGAA CAAGATGATT 
GAATATTTAG TGCCTCACGG ATATTTGCTT GTTCTACTAA 


15 


7151 


Righ.t Arm 

TAGGTAAAAC- CCCATTACAT CATTCGGTAA- TTAATAGAAG AAAAGATGTA 
ATCCATTTTG GGGTAATGTA GTAAGCCATT AATTATCTTC TTTTCTACAT 


7201 


Right Arm 

ACAGCACTTC TGTTAAATCT AGGAGCTGAT ATAAACGTAA TAGATGACTG 
TGTCGTGAAG ACAATTTAGA TCCTCGACTA TATTTGCATX ATCTACTGAC 


20 


■•. 7251 


Right Arm 

TATGGGCAGT CCCTTACATT ACGCTGTTTC ACGTAACGAT ATCGAAACAA 
ATACCCGTCA GGGAATGTAA TGCGACAAAG TGCATTGCTA TAGCTTTGTT 


.25 


7301 


CAAAGACACT 
GTTTCTGTGA 


Right .Arm 

TTTAGAAAGA GGATCTAATG TTAATGTGGT TAATAATCAT 
AAATCTTTCT CCTAGATTAC AATTACACCA ATTATTAGTA 


30 


7351 


Right Arm 

ATAGATACCG TTCTAAATAT AGCTGTTGCA TCTAAAAACA AAACTATAGT 
. TATCTATGGC AAGATTTATA TCGACAACGT AGATTTTTGT TTTGATATCA 


35 


7401 


AAACTTATTA 
TTTGAATAAT 


Right Arm 

CTGAAGTACG GTACTGATAC AAAGTTGGTA. GGATTAGATA 
GACTTCATGC CATGACTATG TTTCAACCAT CCTAATCTAT 


7451 


AACATGTTAT 
TTGTACAATA 


Right Arm 

TCACATAGCT ATAGAAATGA AAGATATTAA TATACTGAAT 
AGTGTATCGA TATCTTTACT TTCTATAATT ATATGAGTTA 


40 


7501 


GCGATCTTAT 
CGCTAGAATA 


Right Arm 

TATATGGTTG CTATGTAAAC GTCTATAATC ATAAAGGTTT 
ATATACCAAC GATACATTTG CAGATATTAG TATTTCCAAA 


45 


. 7551' 


. CACTCCTCTA 
GTGAGGAGAT 


Right Arm 

TACATGGCAG TTAGTTCTAT GAAAACAGAA TTTGTTAAAC 
ATGTACCGTC AATCAAGATA CTTTTGTCTT AAACAATTTG 


50 


7601 


TCTTACTTGA 
AGAATGAACT 


Right Arm " 

CCACGGTGCT TACGTAAATG CTAAAGCTAA GTTATCTGGA 
GGTGCCACGA ATGCATTTAC GATTTCGATT CAATAGACCT 


55 


7651 


Right Arm 

AATAQTCCTT TACATAAAGC TATGTTATCT AATAGTTTTA ATAATATAAA 
TTATGAGGAA ATGTATTTCG ATACAATAGA TTATCAAAAT TATTATATTT 
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lid i 


ATTACTTTTA 
TAATGAAAAT 


Right Arm 
TCTTATAACG CCGACTATAA 
AGAATATTGC GGCTGATATT 


TTCTCTAAAT 
AAGAGATTTA 


AATCACGGTA 
TTAGTGCCAT 


5 


1 1 t^i 


Right Arm 

ATACQCCTCT AACTTGTGTT AGCTTTTTAG ATGACAAGAT AGCTATTATG 
TATGCGGAGA TTGAACACAA TCGAAAAATC TACTGTTCTA TCGATAATAC 


10 


i q m 


■ ATAATATCTA 
TATTATAGAT 


Right Arm 
AAATGATGTT AGAAATATCT 
TTTACTACAA TCTTTATAGA 


AAAAATCCTG 
TTTTTAGGAC 


AAATAGCTAA 
TTTATCGATT 


15 


7QC1 

fOOl 


TTCAGAAGGT 
AA'GTCTTCCA 


Right Arm 
TTTATAGTAA ACATGGAACA 
AAATATCATT TGTACCTTGT 


TATAAACAGT 
ATATTTGTCA 


AATAAAAGAC 
TTATTTTCTG 


20 


/ yui 


TACTATCTAT 
ATGATAGATA 


Right .Arm 
AAAAGAATCA TGCGAAAAAG 
TTTTCTTAGT ACGCTT.TTTC 


AACTAGATGT 
TTGATCTACA 


TATAACACAT 
ATATTGTGTA 


, 7951 


ATAAAGTTAA 
TATTTCAATT 


Right Arm 
ATTCTATATA TTCTTTTAAT 
TAAGATATAT AAGAAAATTA 


ATCTTTCTTG 
TAGAAAGAAC 


ACAATAACAT 
TGTTATTGTA 


25 


8001 


AGATCTTATG 
TCTAGAATAC 


Right Arm 
GTAAAGTTCG TAACTAATCC 
CATTTCAAGC ATTGATTAGG 


TAGAGTTAAT 
ATCTCAATTA 


AAGATACCTG 
TTCTATGGAC 


30 




CATGTATACG 
GTACATATGC 


Right Arm 
TATATATAGG- GAATTAATAC 
ATATATATCC CTTAATTATG 


qGAAAAATAA ATCATTAGCT 
CCTTTTTATT TAGTAATCGA 


35 




TTTCATAGAC 
AAAGTATCTG 


Right Arm 
ATCAGCTAAT AGTTAAAGCT 
TAGTCGATTA TCAATTTCGA 


GTAAAAGAGA 
CATTTTCTCT 


GTAAGAATCT 
CATTCTTAGA 


40 


8151 


AGGAATAATA 
TCCTTATTAT 


Right Arm 
GGTAGGTTAC CTATAGATAT 
CCATCCAATG GATATCTATA 


CAAACATATA 
GTTTGTATAT 


ATAATGGAAC 
TATTACCTTG. 


8201 


TATTAAGTAA 
ATAATTCATT 


Right Arm 
TAATGATTTA CATTCTGTTA 
ATTACTA7^AT GTAAGACAAT 


TCACCAGCTG 
AGTGGTCGAC 


TTGTAACCCA . 
AACATTGGGT 


45 


8251 


GTAGTATAAA 
CATCATATTT 


Right Arm 
GAGCTCCAGC TTTTGTTCCC 
CTCGAGGTCG AAAACAAGGG 


TTTAGTGAGG 
AAATCACTCC 


GTTAATTCCG 
CAATTAAGGC 



Right Arm 

50 8301 AGCTTGGCGT AATCATGGTC * ATAGCTGTTT CCTGTGTGAA ATTGTTATCC 
TCGAACCGCA TTAGTACCAG TATCGACAAA GGACACACTT TAACAATAGG 
8351 GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG TGTAAAGCCT 
CGAGTGTTAA GGTGTGTTGT . ATGCTCGGCC TTCGTATTTC ACATTTCGGA 
8401 GGGGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG 
55 CCCCACGGAT TACTCACTCG ATTGAGTGTA ATTAACGCAA CGCGAGTGAC 

8451 CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG 
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GGGCGAAAGG TCAGCCCTTT GGACAGCACG GTCGACGTAA TTACTTAGCC 
8501 CCAACGCGCG GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT 
GGTTGCGCGC CCCTCTCCGC CAAACGCATA ACCCGCGAGA AGGCGAAGGA 
8551 CGCTCACTGA CTCGCTGCGC TCGGTCGTTC GGCTGCGGCG AGCGGTATCA 
GCGAGTGACT GAGCGACGCG AGCCAGCAAG CCGACGCCGC TCGCCATAGT 
8601 ' GCTCACTCAA AGGCGGTAAT ACGGTTATCC ACAGAATCAG GGGATAACGC 
CGAGTGAGTT TCCGCCATTA TGCCAATAGG TGTCTTAGTC CCCTATTGCG 
8651 AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA 
TCCTTTCTTG TACACTCGTT TTCCGGTCGT TTTCCGGTCC TTGGCATTTT 
8701 • AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT 
TCCGGCGCAA CGACCGCAAA AAGGTATCCG AGGCGGGGGG ACTGCTCGTA 
8751 CACAAAAATC GACGCTCAAG TQAGAGGTGG CGAAACCCGA CAGGACTATA 
GTGTTTTTAG CTGCGAGTTC AGTCTCCACC GCTTTGGGCT GTCCTGATAT 
8801 AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC 
TTCTATGGTC CGCAAAGGGG GACCTTCGAG GGAGCACGCG AGAGGACAAG 
8851 CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC 
GCTGGGACGG CGAATGGCCT ATGGACAGGC GGAAAGAGGG AAGCCCTTCG 
8901 ' GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT 
. CACCGCGAAA GAGTATCGAG TGCGACATCC ATAGAGTCAA GCCACATCCA 
8951 CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC 
GCAAGCGAGG TTCGACCCGA CACACGTGCT TGGGGGGCAA GTCGGGCTGG 
9001 GCTGCGCCTT ATCQGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC 
CGACGCGGAA TAGGCCATTG ATAGCAGAAC TCAGGTTGGG CCATTCTGTG 
9051 GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG 
CTGAATAGCG GTGACCGTCG TCGGTGACCA TTGTCCTAAT CGTCTCGCTC 
9101 ■ GTATGTAGGC GGTGCTACAG AGTTCTTGAA "GTGGTGGCCT AACTACGGCT 
CATACATCCG CCACGATGTC TCAAGAACTT CACCACCGGA. TTGATGCCGA 
9.151 ACACTAGAAG GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC 
TGTGATCTTC CTGTCATAAA CCATAGACGC GAGACGACTT CGGTCAATGG 
9201 TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG 
AAGCCTTTTT CTCAACCATC GAGAACTAGG CCGTTTGTTT GGTGGCGACC 
9251 TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC, AGAAAAAAAG 
ATCGCCACCA AAAAAACAAA CGTTCGTCGT CTAATGCGCG TCTTTTTTTC 
9301 GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG 
CTAGAQTTCT TCTAGGAAAC TAGAAAAGAT GCCCCAGACT GCGAGTCACC ' 
93 5i AACGAAAACT CACGTTAAGG GATTTTGGTC, ATGAGATTAT CAAAAAGGAT 
TTGCTTTTGA GTGCAATTCC CTAAAACCAG TACTCTAATA GTTTTTCCTA 
9401 CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA 
. GAAGTGGATC TAGGAAAATT TAATTTTTAC TTCAAAATTT AGTTAGATTT 
9451 GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG 
CATATATACT CATTTGAACC AGACTGTCAA- TGGTTACGAA TTAGTCACTC 
9501 GCACCTATCT CAGCGATCTG TCTATTTCGT T CATC CAT AG TTGCCTGACT 
CGTGGATAGA GTCGCTAGAC AGATAAAGCA AGTAGGTATC AACGGACTGA 
9551 CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA 
. GGGGCAGCAC ATCTATTGAT GCTATGCCCT CCCGAATGGT AGACCGGGGT 
9601 GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA 
CACGACGTTA CTATGGCGCT CTGGGTGCGA GTGGCCGAGG TCTAAATAGT 
9651 GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 
CGTTATTTGG TCGGTCGGCC TTCCCGGCTC GCGTCTTCAC CAGGACGTTG 
9701 TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA 

AAATAGGCGG' AGGTAGGTCA GATAATTAAC AACGGCCCTT CGATCTCATT ' 
9751 GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC 
CATCAAGCGG TCAATTATCA AACGCGTTGC AACAACGGTA ACGATGTCCG 
9801 ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC 
' TAGCACCACA GTGCGAGCAG CAAACCATAC CGAAGTAAGT CGAGGCCAAG 
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9851 CCAACGATCA AGGCGAGTTA CATGATCCCC 
• GGTTGCTAGT TCCGCTCAAT GTACTAGGGG 
9901 TTAGCTCCTT CGGTCCTCCG • ATCGTTGTCA 
AATCGAGGAA GCCAGGAGGC TAGCAACAGT 
5 9951 TTATCACTCA TGGTTATGGC AGCACTGCAT 
AATAGTGAGT ACCAATACCG TCGTGACGTA 
10001 ATCCGTAAGA TGCTTTTCTG TGACTGGTGA 
TAGGCATTCT ACGAAAAGAC ACTGACCACT 
10051 GAGAATAGTG TATGCGGCGA CCGAGTTGCT 
10 ' CTCTTATCAC ATACGCCGCT GGCTCAACGA 

10101 . GATAATACCG CGCCACATAG CAGAACTTTA 
CTATTATGGC GCGGTGTATC GTCTTGAAAT 
10151 ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT 
TGCAAGAAGC CCCGCTTTTG AGAGTTCCTA 
15 10201 GTTCGATGTA ACCCACTCGT GCACCCAACT 
CAAGCTACAT TGGGTGAGCA CGTGGGTTGA 
10251 TTCACCAGCG TTTCTGGGTG AGCAAAAACA 
AAGTGGTCGC AAAGACCCAC TCGTTTTTGT 
10301 AAAGGGAATA AGGGCGACAC GGAAATGTTG 
20 . TTTCCCTTAT TCCCGCTGTG CCTTTACAAC 

10351 TTCAATATTA TTGAAGCATT TATCAGGGTT 
AAGTTATAAT AACTTCGTAA ATAGTCCGAA 
' 10401 ATATTTGAAT GTATTTAGAA AAATAAACAA 
TATAAACTTA CATAAATCTT TTTATTTGTT 
25 10451 TCCCCGAAAA GTGCCACCTGAGGGGCTI 



CATGTTGTGC AAAAAAGCGG 
GTACAACACG T'fTTTTCGCC 
GAAGTAAGTT GGCCGCAGTG 
CTTCATTCAA CCGGCGTCAC 
AATTCTCTTA CTGTCATGCC 
TTAAGAGAAT GACAGTACGG 
GTACTCAACC AAGTCATTCT 
CATGAGTTGG TTCAGTAAGA 
CTTGCCCGGC GTCAATACGG 
GAACGGGCCG CAGTTATGCC 
AAAGTGCTCA TCATTGGAAA 
TTTCACGAGT AGTAACCTTT 
CTTACCGCTG TTGAGATCCA 
GAATGGCGAC AACTCTAGGT 
GATCTTCAGC ATCTTTTACT 
CTAGAAGTCG TAGAAAATGA 
GGAAGGCAAA ATGCCGCAAA 
CCTTCCGTTT TACGGCGTTT 
AATACTCATA CTCtTCCTTT 
TTATGAGTAT GAGAAGGAAA 
ATTGTCTCAT GAGCGGATAC 
TAACAGAGTA CTCGCCTATG 
ATAGGGGTTC CGCGCACATT 
TATCCCCAAG GCGCGTGTAA 
TT C ACGGTGGAC 
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FIGURE 3 ; Donor plasmid pi 132 

C5 Right Arm 



5 - 1 .TGAATGTTAA ATGTTATACT TTGGATGAAG GTATAAATAT GCATTGGAAA 

ACTTACAATT TACAATATGA AACCTACTTC GATATTTATA CGTAACCTTT 
C5 Right Arm 

51 AATAATCCAT TTAAAGAAAG GATTCAAATA CTACAAAACC TAAGCGATAA 
10 ' TTATTAGGTA AATTTCTTTC CTAAGTTTAT GATGTTTTGG ATTCGCTATT 

C5 Right Arm 

101 TATGTTAACT' AAGCTTATTC TTAACGACGC TTTAAATATA CACAAATAAA 
ATACAATTGA TTCGAATAAG AATTGCTGQG AAATTTATAT GTGTTTATTT 
15 C5 Right Arm 

151 CATAATTTTT GTATAACCTA ACAAATAACT AAAACATAAA AATAATAAAA 
. GTATTAAAAA CATATTGGAT TGTTTATTGA TTTTGTATTT TTATTATTTT 
C5 Right -Arm 

20 — — — _ 

201 GGAAATGTAA TATCGTAATT ATTTTACTCA GGAATGGGGT TAAATATTTA 
CCTTTACATT ATAGCATTAA TAAAATGAGT CCTTACCCCA ATTTATAAAT 
C5 Right Arm 

25 " 25l" TATCACGTGT ATATCTATAC TGTTATCGTA TACTCTTTAC AATTACTATT 
ATAGTGCACA ITATAGATATG ACAATAGCAT ATGAGAAATG TTAATGATAA 
C5 Right Arm 

301 ACGAATATGC AAGAGATAAT AAGATTACGT ATTTAAGAGA ATCTTGTCAT 
. 30 TGCTTATACG TTCTCTATTA TTCTAATGCA TAAATTCTCT TAGAACAGTA 

C5 Right Arm 

* 351 GATAATTGGG TACGACATAG TGATAAATGC TATTTCGCAT CGTTACATAA 
. CTATTAACCC ATGCTGTATC ACTATTTACG ATAAAGCGTA GCAATGTATT ' 
35 C5 Right Arm 

401 AGTCAGTTGG AAAGATGGAT TTGACAGATG TAACTTAATA GGTGCAAAAA 
TCAGTCAACC TTTCTACCTA AACTGTCTAC ATTGAATTAT CCACGTTTTT 
C5 Right Arm 

40 _ 

451 TGTTAAATAA CAGCATTCTA TCGGAAGATA GGATACCAGT TATATTATAC 
ACAATTTATT GTCGTAAGAT AGCCTTCTAT CCTATGGTCA ATATAATATG 
C5 Right Arm 

45 501 AAAAATCACT GGTTGGATAA AACAGATTCT GCAATATTCG TAAAAGATGA 

TTTTTAGTGA CCAACCTATT TTGTCTAAGA CGTTATAAGC ATTTTCTACT 
C5 Right Arm 

551 AGATTACTGC GAATTTGTAA ACTATGACAA TAAAAAGCCA TTTATCTCAA 
50 TCTAATGACG CTTAAACATT TGATACTGTT ATTTTTCGGT AAATAGAGTT 

C5 Right Arm 

601 CGACATCGTG TAATTCTTCC ATGTTTTATG TATGTGTTTC AGATATTATG 
GCTGTAGCAC ATTAAGAAGG TACAAAATAC ATACACAAAG TCTATAATAC 



55 
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C5 Right Arm 



651 AGATTACTAT AAACTTTTTG TATACTTATA TTCCGTAAAC TATATTAATC ' 
TCTAATGATA TTTGAAAAAC ATATGAATAT AAGGCATTTG ATATAATTAG 
C5 Right Arm 

701 ATGAAGAAAA TGAAAAAGTA TAGAAGCTGT TCACGAGCGG TTGTTGAAAA 
TACTTCTTTT ACTTTTTCAT ATCTTCGACA AGTGCTCGCC AACAACTTTT 
C5 Right Arm 



751 CAACAAAATT ATACATTCAA GATGGCTTAC ATATAGGTCT GTGAGGCTAT 
■ GTTGTTTTAA TATGTAAGTT CTACCGAATG TATATGCAGA CACTCCGATA 

C5 Right Arm 



801 CATGGATAAT GACAATGCAT CTCTAAATAG GTTTTTGGAC AATGGATTCG 
GTACCTATTA CTGTTACGTA GAGATTTATC CAAAAACCTG TTACCTAAGC 
C5 Right Arm 



851 ACCCTAACAC GGAATATGGT ACTCTACAAT CTCCTCTTGA AATGGCTGTA 
TGGGATTGTG CCTTATACCA TGAGATGTTA GAGGAGAACT TTACCGACAT- 
C5 Right Arm 



901 ATGTTCAAGA ATACCGAGGC TATAAAAATC TTGATGAGGT ATGGAGCTAA 
TACAAGTTCT TATGGCTCCG' ATATTTTTAG AACTACTCCA TACCTCGATT 
C5 Right Arm 



951 ACCTGTAGTT ACTGAATGCA CAACTTCTTG TCTGCATGAT GCGGTGTTGA 
TGGACATCAA* TGACTTACGT GTTGAAGAAC AGACGTACTA CGCCACAACT 
" C5 Right Arm 



1001 GAGACGACTA CAAAATAGTG AAAGATCTGT TGAAGAATAA CTATGTAAAC 
CTCTGCTGAT GTTTTATCAC TTTCTAGACA ACTTCTTATT GATACATTTG 
C5 Right Arm 

1051 AATGTTCTTT ACAGCGGAGG CTTTACTCCT TTGTGTTTGG CAGCTTACCT 
TTACAAGAAA TGTCGCCTCC GAAATGAGGA AACACAAACC GTCGAATGGA 
C5 Right Arm 

1101 TAACAAAGTT AATTTGGTTA AACTTCTATT GGCTCATTCG GCGGATGTAG 
• ATTGTTTCAA TTAAACCAAT TTGAAGATAA CCGAGTAAGC CGCCTACATC 

C5 Right Arm 



1151 ATATTTCAAA CACGGATCGG TTAACTCCTC TACATATAGC CGTATCAAAT ' 
TATAAAGTTT GTGCCTAGCC AATTGAGGAG ATGTATATCG GCATAGTTTA 
C5 Right Arm 

1201 AAAAATTTAA CAATGGTTAA ACTTCTATTG AACAAAGGTG CTGATACTGA 
TTTTTAAATT GTTACCAATT TGAAGATAAC TTGTTTCCAC GACTATGACT 
. C5- Right Arm 

1251 " CTTGCTGGAT AACATGGGAT GTACTCCTTT AATGATCGCT GTACAATCTG 
GAACGACCTA TTGTACCCTA. CATGAGGAAA TTACTAGCGA CATGTTAGAC 
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C5 Right Arm 



10 



20 



25 



30 



1301 



1351 



1401 



15 1451 



1501 



1551 



40 



45 



50 



55 1951 



GAAATAT"fGA AATATGTAGC ACACTACTTA AAAAAAATAA AATGTCCAGA 
CTTTATAACT TTATACATCG TGTGATGAAT TTTTTTTATT TTACAGGTCT ' 
C5 Right Arm 

ACTGGGAAAA ATTGATCTTG CCAGCTGTAA TTCATGGTAG AAAAGAAGTC3 
TGACCCTTTT TAACTAGAAC GGTCGACATT AAGTACCATC TTTTCTTCAC 
C5 Right Arm 

CTCAGGCTAC TTTTCAACAA AGGAGCAGAT GTAAACTACA TCTTTGAAAG ' 
GAGTCCGATG AAAAGTTGTT TCCTCGTCTA CATTTGATGT AGAAACTTTC 
C5 Right Arm 

AAATGGAAAA TCATATACTG TTTTGGAATT GATTAAAGAA AGTTACTCTG 
TTTACCTTTT AGTATATGAC AAAACCTTAA CTAATTTCTT TCAATGAGAC * 
C5 Right Arm 

AGACACAAAA GAGGTAGCTG AAGTGGTACT CTCAAAGGTA CGTGACTAAT 
TCTGTGTTTT CTCCATCGAC TTCACCATGA GAGTTTCCAT GCACTGATTA 

Repeat Region 



TAGCTATAAA AAGGATCGGC CGCTCTAGAA CTAGTGGATC GGGTTCTTTA 
ATCGATATTT TTCCTAGCCG GCGAGATCTT GATCACCTAG CCCAAGAAAT 
Repeat Region 



1601 


TTCTATACTT AAAAAGTGAA AATAAATACA AAGGTTCTTG AGGGTTGTGT 
AAGATATGAA TTTTTCACTT TTATTTATGT TTCCAAGAAC TCCCAACACA 
Repeat Region 


1651 


TAAATTGAAA 
ATTTAACTTT 


GCGAGAAATA ATCATAAATT 
CGCTCTTTAT TAGTATTTAA 
Repeat Region 


ATTTCATTAT 
TAAAGTAATA 


CGCGATATCC 
GCGCTATAGG 


1701 


GTTAAGTTTG 
CAATTCAAAC 


TATCGTACCC CGATCCCCCG AGCCATGCAG 
ATAGCATGGG GCTAGGGGGC TCGGTACGTC 
Repeat Region 


GCCGAAGGCC 
CGGCTTCCGG 


1751 


GGGGCACAGG 
CCCCGTGTCC 


GGGTTCGACG GGCGATGCTG ATGGCCCAGG AGGCCCTGGC. 
CCCAAGCTGC CCGCTACGAC TACCGGGTCC TCCGGGACCG 
■ Repeat Region ! 


1801 


ATTCCTGATG 
TAAGGACTAC 


GCCCAGGGGG CAATGCTGGC 
CGGGTCCCCC GTTACGACCG 
Repeat Region 


GGCCCAGGAG 
CCGGGTCCTC 


AGGCGGGTGC 
TCCGCCCACG 


1851 


CACGGGCGGC 
GTGCCCGCCG 


AGAGGTCCCC GGGGCGCAGG 
TCTCCAGGGG CCCCGCGTCC 
Repeat Region 


GGCAGCAAGG 
CCGTCGTTCC 


GCCTCGGGGC 
CGGAGCCCCG 


1901 


CGGGAGGAGG 
GCCCTCCTCC 


CGCCCCGCGG GGTCCGCATG 
GCGGGGCGCC CCAGGCGTAC 
Repeat Region 


GCGGCGCGGC 
CGCCGCGCCG 


TTCAGGGCTG 
AAGTCCCGAC 



AATGGATGCT GCAGATGCGG GGCCAGGGGG CCGGAGAGCC GCCTGCTTGA 
TTACCTACGA CGTCTACGCC CCGGTCCCCC GGCCTCTCGG CGGACGAACT 
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Repeat Region 



2001 GTTCTACCTC GCCATGCCTT TCGCGACACC CATAGCTTGA TATCGAATTC 
CAAGATGGAG CGGTACGGAA AGCGCTGTGG GTATCGAACT ATAGCTTAAG 
5 C1B promoter 



10 


2051 


TAGGGGGATC 
ATCCCCCTAG 


CACTAGTTCT AGAGGATCAT 
GTGATCAAGA TCTCCTAGTA 
C1B promoter 


TATTTAACGT 
ATAAATTGCA 


AAACTAAATG 
TTTGATTTAC 


2101 


GAAAAGCTAT 
CTTTTCGATA 


TTACAGGTAC ATACGGTGTT TTTCTGGAAT 
AATGTCCATG TATGCCACAA AAAGACCTTA 
C1B promoter 


CAAATGATTC 
GTTTACTAAG 


15 


2151 


TGATTTTGAG 
ACTAAAACTC 


GATTTTATCA ATACAATAAT 
CTAAAATAGT TATGTTATTA 
C1B promoter 


GACAGTGCTA ACTGGTAAAA 
CTGTCACGAT TGACCATTTT 


20 


2201 


AAGAAAGCAA 
TTCTTTCGTT 


ACAATTATCA TGGCTAACAA TTT.TTATTAT ATTTGTAGTA 
TGTTAATAGT ACCGATTGTT AAAAATAATA TAAACATCAT 
C1B promoter 


25 


2251 


TGCATAGTGG TCTTTACGTT TCTTTATTTA AAGTTAATGT 
ACGTATCACC AGAAATGCAA AGAAATAAAT TTCAATTACA 
C1B promoter LacZ 


GTTAAGATTA 
CAATTCTAAT 


30 


2301 


AATGGAGTAA 
TTACCTCATT 


TTGGATCCCC 
AACCTAGGGG 


CATCGATGGG 
GTAGCTACCC 
LacZ 


GAATTCACTG 
CTTAAGTGAC 


GCCGTCGTTT 
CGGCAGCAAA 


2351 


TACAACGTCG 
ATGTTGCAGC 


TGACTGGGAA AACCCTGGCG 
ACTGACCCTT TTGGGACCGC 
LacZ 


TTACCCAACT TAATCGCCTT 
AATGGGTTGA ATTAGCGGAA 


35 


2401 


GCAGCACATC 
CGTCGTGTAG 


CCCCTTTCGC 
GGGGAAAGCG 


CAGCTGGCGT 
GTCGACCGCA 
LacZ 


AATAGCGAAG 
TTATCGCTTC 


AGGCCCGCAC 
TCCGGGCGTG 


40 


2451 


CGATCGCCCT 
GCTAGCGGGA 


TCCCAACAGT 
AGGGTTGTCA 


TGCGCAGCCT 
ACGCGTCGGA 
LacZ 


GAATGGCGAA 
CTTACCGCTT 


TGGCGCTTTG 
ACCGC€AAAC 


45 


2501 


CCTGGTTTCC 
GGACCAAAGG 


GGCACCAGAA 
CCGTGGTCTT 


GCGGTGCCGG 
CGCCACGGCC 
LacZ 


AAAGCTGGCT 
TTTCGACCGA 


GGAGTGCGAT 
CCTCACGCTA 


50 


2551 


CTTCCTGAGG 
GAAGGACTCC 


CCGATACTGT 
GGCTATGACA 


CGTCGTCCCC 
GCAGCAGGGG 
LacZ 


TCAAACTGGC 
AGTTTGACCG 


AGATGCACGG 
TCTACGTGCC 


2601 


TTACGATGCG 
AATGCTACGC 


CCCATCTACA CCAACGTGAC 
GGGTAGATGT GGTTGCACTG 
LacZ 


CTATCCCATT ACGGTCAATC 
GATAGGGTAA TGCCAGTTAG 



55 2651 CGCCGTTTGT TCCCACGGAG AATCCGACGG GTTGTTACTC GCTCACATTT 



GCGGCAAACA AGGGTGCCTC TTAGGCTGCC CAACAATGAG CGAGTGTAAA 
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LacZ 



5 


2701 


AATGTTGATG AAAGCTGGCT ACAGGAAGGC CAGACGCGAA TTATTTTTGA 
TTACAACTAC TTTCGACCGA TGTCCTTCCG GTCTGCGCTT AATAAAAACT 

LacZ 


10 


2751 ' 


TGGCGTTAAC 
ACCGCAATTG 


TCGGCGTTTC ATCTGTGGTG CAACGGGCGC 
AGCCGCAAAG TAGACACCAC GTTGCCCGCG 
LacZ 


TGGGTCGGTT 
ACCCAGCCAA 


2801 


. ACGGCCAGGA 
TGCCGGTCCT 


CAGTCGTTTG CCGTCTGAAT TTGACCTGAG CGCATTTTTA 
GTCAGCAAAC GGCAGACTTA AACTGGACTC GCGTAAAAAT 
LacZ 


15 


2851 


CGCGCCGGAG 
GCGCGGCCTC 


AAAACCGCCT CGCGGTGATG GTGCTGCGCT 
TTTTGGCGGA GCGCCACTAC CACGACGCGA 
LacZ 


GGAGTGACGG 
CCTCACTGCC 


20 


2901 


CAGTTATCTG 
GTCAATAGAC 


QAAGATCAGG ATATGTGGCG GATGAGCGGC ATTtTCCGTG 
CTTCTAGTCC TATACACCGC CTACTCGCCG TAAAAGGCAC 
LacZ 


25 


' 2951 


ACGTCTCGTT 
TGCAGAGCAA 


GCTGCATAAA CCGACTACAC AAATCAGCGA TTTCCATGTT 
CGACGTATTT GGCTGATGTG TTTAGTCGCT AAAGGTACAA 
LacZ 


30 


3001 


GCCACTCGCT 
CGQTGAGCGA 


TTAATGATGA TTTCAGCCGC GCTGTACTGG 
AATTACTACT AAAGTCGGCG CGACATGACC 
LacZ 


AGGCTGAAGT 
TCCGACTTCA 


3051 


TCAGATGTGC 
AGTCTACACG 


GGCGAGTTGC GTGACTACCT ACGGGTAACA 
CCGCTCAACG CACTGATGGA TGCCCATTGT 
LacZ 


GTTTCTTTAT 
CAAAGAAATA 


35 


3101 


* GGCAGGGTGA 
CCGTCCCACT 


AACGCAGGTC GCCAGCGGCA CCGCGCCTTT 
TTGCGTCCAG CGGTCGCCGT GGCGCGGAAA 
LacZ 


CGGCGGTGAA 
GCCGCCACTT 


40 


3151 


ATTATCGATG 
TAATAGCTAC 


AGCGTGGTGG TTATGCCGAT CGCGTCACAC TACGTCTGAA*. 
TCGCACCACC AATACGGCTA GCGCAGTGTG ATGCAGACTT 
LacZ 


45 


3201 


~CGTCGAAAAC 
GCAGCTTTTG 


CCGAAACTGT GGAGCGCCGA AATCCCGAAT 
GGCTTTGACA CCTCGCGGCT TTAGGGCTTA 
LacZ 


CTCTATCGTG 
GAGATAGCAC 


50 


3251 


CGGTGGTTGA 
GCCACCAACT 


ACTGCACACC GCCGACGGCA CGCTGATTGA 
TGACGTGTGG CGGCTGCCGT GCGACTAACT 
LacZ 


AGCAGAAGCC 
TCGTCTTCGG 


3301 


TGCGATGTCG 
ACGCTACAGC 


GTTTCCGCGA GGTGCGGATT GAAAATGGTC TGCTGCTGCT 
CAAAGGCGCT CCACGCCTAA CTTTTACCAG ACGACGACGA 
LacZ 



55 ' 3351 GAACGGCAAG CCGTTGCTGA TTCGAGGCGT TAACCGTCAC GAGCATCATC 



CTTGCCGTTC GGCAACGACT AAGCTCCGCA ATTGGCAGTG CTCGTAGTAG 



20/53 



WO 2005/026370 



PCT/US2004/028751 



LacZ 



5 


3401 


CTCTGCATGG TCAGGTCATG 
GAGAGGTACC AGTCCAGTAC 


GATGAGCAGA 
CTACTCGTCT 
LacZ 


CGATGGTGCA 
GCTACCACGT 


GGATATCCTG 
CCTATAGGAC 


.10 


3451 


CTGATGAAGC AGAACAACTT 
GACTACTTCG TCTTGTTGAA 


TAACGCCGTG 
ATTGCGGCAC 
LacZ 


CGCTGTTCGC 
GCGACAAGCG 


ATTATCCGAA 
TAATAGGCTT 


3501 


CCATCCGCTG 
GGTAGGCGAC 


TGGTACACGC 
ACCATGTGCG 


TGTGCGACCG 
ACACGCTGGC 
LacZ 


CTACGGCCTG 
GATGCCGGAC 


TATGTGGTGG 
ATACACCACC 


15 


3551 


ATGAAGCCAA 
TACTTCGGTT 


TATTGAAACC 
ATAACTTTGG 


CACGGCATGG 
GTGCCGTACC 
LacZ 


TGCCAATGAA 
ACGGTTACTT 


TCGTCTGACC 
AGCAGACTGG 


20 


3601; 


GATGATCCGC 
CTACTAGGCG 


GCTGGCTACC 
CGACCGATGG 


GGCGATGAGC 
CCGCTACTCG 
LacZ 


GAACGCGTAA 
CTTGCGCATT 


CGCGAATGGT 
' GCGCTTACCA 


25 


3651 


GCAGCGCGAT 
CGTCGCGCTA 


CGTAATCACC 
GCATTAGTGG 


CGAGTGTGAT 
GCTCACACTA 
LacZ 


CATCTGGTCG 
GTAGACCAGC 


CTCGGGAATG 
GACCCCTTAC 


30 


. 3701 


AATCAGGCCA 
TTAGTCCGGT 


CGGCGCTAAT 
GCCGCGATTA 


CACGACGCGC 
GTGCTGCGCG 
LacZ 


TGTATCGCTG 
ACATAGCGAC 


GATCAAATCT 
CTAGTTTAGA 


3751 


■ GTCGATCCTT 
CAGCTAGGAA 


CCCGCCCGGT 
GGGCGGGCCA 


GCAGTATGAA 
CGTCATACTT 
LacZ 


GGCGGCGGAG 
CCGCCGCCTC 


CCGACACCAC 
GGCTGTGGTG 


35 ■ 


3801 


GGCCACCGAT 
CCGGTGGCTA 


ATTATTTGCC 
TAATAAACGG 


CGATGTACGC GCGCGTGGAT 
GCTACATGCG CGCGCACCTA 
LacZ 


GAAGACCAGC 
CTTCTGGTCG 


40 


" 3851 


. CCTTCCCGGC 
GGAAGGGCCG 


TGTGCCGAAA 
ACACGGCTTT 


TGGTCCATCA AAAAATGGCT TTCGCTACCT 
ACCAGGTAGT TTTTTACCGA AAGCGATGGA 
LacZ 


45 


3901- 


GGAGAGACGC 
CCTCTCTGCG 


GCCCGCTGAT 
CGGGCGACTA 


CCTTTGCGAA 
GGAAACGCTT 
LacZ 


TACGCCCACG 
ATGCGGGTGC 


CGATGGGTAA 
GCTACCCATT 


50 


3951 


CAGTCTTGGC 
GTCAGAACCG . 


GGTTTCGCTA AATACTGGCA GGCGTTTCGT 
CCAAAGCGAT TTATGACCGT CCGCAAAGCA 
LacZ 


CAGTATCCCC 
GTCATAGGGG 


4001 


GTT.TACAGGG ' 
CAAATGTCCC 


CGGCTTCGTC 
GCCGAAGCAG 


TGGGACTGGG TGGATCAGTC 
ACCCTGACCC ACCTAGTCAG 
LacZ 


GCTGATTAAA 
CGACTAATTT 



55 4051 ' TATGATGAAA ACGGCAACCC GTGGTCGGCT TACGGCGGTG ATTTTGGCGA 



ATACTACTTT TGCCGTTGGG CACCAGCCGA ATGCCGCCAC TAAAACCGCT 
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LacZ 



5 


4101 


TACGCCGAAC 
ATGCGGCTTG 


GATCGCCAGT TCTGTATGAA 
CTAGCGGTCA AGACATACTT 
LacZ 


. CGGTCTGGTC 
GCCAGACCAG 


TTTGCCGACC 
AAACGGCTGG 


10 


4151 


GCACGCCGCA 
CGTGCGGCGT 


TCCAGCGCTG AGGGAAGCAA 
AGGTCGCGAC TGCCTTCGTT 
LacZ 


AACACCAGCA 
TTGTGGTCGT 


GCAGTTTTTC 
CGTCAAAAAG 


4201 


CAGTTCCGTT 
GTCAAGGCAA 


TATCCGGGCA AACCATCGAA 
ATAGGCCCGT TTGGTAGCTT 
LacZ 


GTGACCAGCG 
CACTGGTCGC 


AATACCTGTT 
TTATGGACAA 


15 


4251 ' 


CCGTCATAGC 
GGCAGTATCG 


GATAACGAGC 
CTATTGCTCG 


TCCTGCACTG 
AGGACGTGAC 
LacZ 


GATGGTG GCG 
CTACCACCGC 


CTGGATGGTA 
GACCTACCAT 


20 


4301 * 


AGCCGCTGGC 
TCGGCGACCG 


AAGCGGTGAA 
TTCGCCACTT 


GTGCCTCTGG ATGTCGCTCC 
CACGGAGACC TACAGCGAGG 
LacZ 


ACAAGGTAAA 
TGTTCCATTT 


25 


4351 


CAGTTGATTG 
GTCAACTAAC 


AACTGCCTGA 
TTGACGGACT 


ACTACCGCAG 
TGATGGCGTC 
.LacZ 


CCGGAGAGCG. 
GGCCTCTCGC 


CCGGGCAACT * 
GGCCCGTTGA 


30 


4401 


CTGGCTCACA 
GACCGAGTGT 


GTACGCGTAG 
CATGCGCATC 


TGCAACCGAA CGCGACCGCA 
ACGTTGGCTT GCGCTGGCGT 
LacZ 


TGGTCAGAAG 
ACCAGTCTTC 


4451 


CCGGGCACAT 
GGCCCGTGTA 


CAGCGCCTGG 
GTCGCGGACC 


CAGCAGTGGC 
GTCGTCACCG 
LacZ 


GTCTGGCGGA 
CAGACCGCCT 


AAACCTCAGT 
TTTGGAGTCA 


35 


4501 


' GTGACGCTCC 
CACTGCGAGG 


CCGCCGCGTC 
GGCGGCGCAG 


CCACGCCATC 
GGTGCGGTAG 
LacZ 


CCGCATCTGA 
GGCGTAGACT 


CCACCAGCGA 
GGTGGTCGCT 


40 


•4551 


AATGGATTTT 
TTACCTAAAA 


TGCATCGAGC 
ACGTAGCTCG 


TGGGTAATAA 
ACCCATTATT 
LacZ 


GCGTTGGCAA 
CGCAACCGTT 


TTTAACCGCC . 
AAATTGGCGG 


45 


4601 


AGTCAGGCTT 
TCAGTCCGAA 


TCTTTCACAG 
AGAAAGTGTC 


ATGTGGATTG 
TACACCTAAC 
LacZ 


GCGATAAAAA 
CGCTATTTTT 


ACAACTGCTG 
TGTTGACGAC 


50 


- 4651 


ACGCCGCTGC 
TGCGGCGACG 


GCGATCAGTT CACCCGTGCA CCGCTGGATA ACGACATTGG 
CGCTAGTCAA GTGGGCACGT GGCGACCTAT TGCTGTAACC 
LacZ 


4701 


CGTAAGTGAA 
GCATTCACTT 


GCGACCCGCA 
CGCTGGGCGT 


TTGACCCTAA 
AACTGGGATT 
LacZ 


CGCCTGGGTC 
GCGGACCCAG 


GAACGCTGGA 
CTTGCGACCT 



55 ' 4T51 AGGCGGCGGG CCATTACCAG GCCGAAGCAG CGTTGTTGCA GTGCACGGCA 



TCCGCCGCCC GGTAATGGTC CGGCTTCGTC GCAACAACGT CACGTGCCGT 
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LacZ 



4801 


GATACACTTG 
CTATGTGAAC 


CTGATGCGGT GCTGATTACG ACCGCTCACG CGTGGCAGCA 
GACTACGCCA CGACTAATGC TGGCGAGTGC GCACCGTCGT 
LacZ 


4851 


TCAGGGGAAA ACCTTATTTA TCAGCCGGAA AACCTACCGG ATTGATGGTA 
AGTCCCCTTT TGGAATAAAT AGTCGGCCTT TTGGATGGCC TAACTACCAT 

LacZ 


4901 


GTGGTCAAAT 
CACCAGTTTA 


GGCGATTACC GTTGATGTTG AAGTGGCGAG CGATACACCG 
CCGCTAATGG CAACTACAAC TTCACCGCTC GCTATGTGGC 
LacZ 


4951 


CATCCGGCGC 
GTAGGCCGCG 


GGATTGGCCT GAAGTGCCAG CT.GGCGCAGG TAGCAGAGCG 
CCTAACCGGA CTTGACGGTC GACCGCGTCC ATCGTCTCGC 
LacZ 


5001/ 


GGTAAACTGG 
CCATTTGACC 


CTCGGATTAG GGCCGCAAGA AAACTATCCC GACCGCCTTA 
GAGGCTAATC CCGGCGTTCT TTTGATAGGG CTGGCGGAAT 
LacZ 


5051 


CTGCCGCCTG 
' GACGGCGGAC 


TTTTGACCGC TGGGATCTGC CATTGTCAGA CATGTATACC 
AAAACTGGCG ACCCTAGACG GTAACAGTCT GTACATATGG 
LacZ 


5101 


CCGTACGTCT 
GGCATGCAGA 


TCCCGAGCGA AAACGGTCTG CGCTGCGGGA. CGCGCGAATT 
AGGGCTCGCT TTTGCCAGAC GCGACGCCCT -GCGCGCTTAA 
LacZ 


5151 • 


GAATTATGGC 
CTTAATACCG 


CCACACCAGT GGCGCGGCGA CTTCCAGTTC AACATCAGCC 
GGTGTGGTCA CCGCGCCGCT GAAGGTCAAG TTGTAGTCGG 
LacZ 


5901 


GGTACAGTCA ACAGCAATTG ATGGAAACCA GCCATTCGCC ATCTGCTGCA 
CCATGTCAGT TGTCGTTAAC TACCTTTGGT CGGTAAGCGG TAGACGACGT 

LacZ 


5251 


CGCGGAAGAG 
GCGCCTTCTC . 


GCACATGGCT GAATATCGAC GGTTTCCATA TGGGGATTGG 
CGTGTACCGA CTTATAGCTG CCAAAGGTAT ACCCCTAACC 
LacZ 


5301 ' 


TGGCGACGAC 
ACCGCTGCTG 


TCCTGGAGCC CGTCAGTATC GGCGGAATTC CAGCTGAGCG 
AGGACCTCGG GCAGTCATAG CCGCCTTAAG GTCGACTCGC 
LacZ 



5351 CCGGTCGCTA CCATTACCAG TTGGTCTGGT GTCAAAAATA ATAATAACCG 
GGCCAGCGAT GGTAATGGTC AACCAGACCA CAGTTTTTAT TATTATTGGC 

5401 GGCAGGGGGG ATCCGGAGCT TATCGCAGAT CAATTCGATA TCAAGCTTAT 
CCGTCCCCCC TAGGCCTCGA ATAGCGTCTA GTTAAGCTAT AGTTCGAATA 



H6 Promoter 

5451 CGATACCGTC GACGGTATCG AT7VAGCTCTA GTGGAGGGTT CTTTATTCTA 
GCTATGGCAG CTGCCATAGC TATTCGAGAT CACCTCCCAA GAAATAAGAT 
H6 Promoter 
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5501 TACTTAAAAA GTGAAAATAA ATACAAAGGT TCTTGAGGGT TGTGTTAAAT 
ATGAATTTTT CACTTTTATT TATGTTTCCA AGAACTCCCA ACACAATTTA 
H6 Promoter . 



5 ■ 


5.551 


TGAAAGCGAG AAATAATCAT AAATTATTTC ATTATCGCGA 
ACTTTCGCTC TTTATTAGTA TTTAATAAAG TAATAGCGCT 
H6 Promoter NYESO-1 


TATCCGTTAA 
ATAGGCAATT 


10 


5601 


GTTTGTATCG 
CAAACATAGC 


TACCCCCCCC GAGCCATGCA GGCCGAAGGC 
ATGGGGGGGG CTCGGTACGT CCGGCTTCCG 
NYESO-1 


CGGGGCACAG 
GCCCCGTGTC 


15 


5651 ' 


GGGGTTCGAC GGGCGATGCT GATGGCCCAG GAGGCCCTGG CATTCCTGAT 
CCCCAAGCTG CCCGCTACGA CTACCGGGTC CTCCGGGACC GTAAGGACTA 

NYESO-1 


20 


5701 


GGCCCAGGGG 
CCGGGTCCCC 


GCAATGCTGG CGGCCCAGGA GAGGCGGGTG 
CGTTACGACC GCCGGGTCGT CTCCGCCCAC 
NYESO-1 


CCACGGGCGG 
GGTGCCCGCC 


5751 


CAGAGGTCCC 
GTCTCCAGGG 


CGGGGCGCAG GGGCAGCAAG GGCCTCGGGG 
GCCCCGCGTC CCCGTCGTTC CCGGAGCCCC 
NYESO-1 


CCGGGAGGAG 
GGCCCTCCTC 


25 


, 580.1- 


GCGCCCCGCG 
CGCGGGGCGC 


GGGTCCGCAT GGCGGCGCGG CTTCAGGGCT 
CCCAGGCGTA CCGCCGCGCC GAAGTCCCGA 
NYESO-1 


GAATGGATGC 
CTTACCTACG 


30 


5851 


TGCAGATGCG 
. ACGTCTACGC 


GGGCCAGGGG GCCGGAGAGC CGCCTGCTTG 
CCGGGTCCCC CGGCCTCTCG GCGGACGAAC 
NYESO-1 


AGTTCTACCT 
TCAAGATGGA 


35 


5901 


CGCCATGCCT 
GCGGTACGGA 


TTCGCGACAC CCATGGAAGC AGAGCTGGCC 
AAGCGCTGTG GGTACCTTCG TCTCGACCGG 
NYESO-1 


CGCAGGAGCC 
GCGTCCTCGG 


40 


5951. 


TGGCCCAGGA 
ACCGGGTCCT 


TGCCCCACCG CTTCCCGTGC CAGGGGTGCT 
ACGGGGTGGC GAAGGGCACG GTCCCCACGA 
NYESO-1 


TCTGAAGGAG 
AGACTTCCTC 


cnni 

DUUl 


TTCACTGTGT 
AAGTGACACA 


CCGGCAACAT ACTGACTATC CGACTGACTG 
GGCCGTTGTA TGACTGATAG GCTGACTGAC 
NYESO-1 ' 


CTGCAGACCA 
GACGTCTGGT 


45 


6051 


CCGCCAACTG CAGCTCTCCA TCAGCTCCTG TCTCCAGCAG CTTTCCCTGT 
GGCGGTTGAC GTCGAGAGGT AGTCGAGGAC AGAGGTCGTC GAAAGGGACA 

NYESO-1 


50 


6101 


TGATGTGGAT CACGCAGGTG TTTCTGCCCG TGTTTTTGGC 
ACTACACCTA GTGCGTCCAC AAAGACGGGC ACAAAAACCG 
NYESO-1 


TCAGCCTCCC 
AGTCGGAGGG 




6151 


TCAG5GCAGA GGCGCTAAGT AATTAATTTT TTTTTGGGCT 
AGTCCCGTCT CCGCGATTCA TTAATTAAAA AAAAACCCGA 


GCAGGATCGC 
CGTCCTAGCG 



55 
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sE/L Promoter 



5 


6201 


TAGCAAAAAT TGAAATTTTA TTTTTTTTTT TTGGAATATA AATAAGCTCG 
ATCGTTTTTA ACTTTAAAAT AAAAAAAAAA AACCTTATAT TTATTCGAGC 

hTRP-2 






sE/L Promoter 








10 


6251 


AAGCTCGAGC 
TTCGAGCTCG 


CATGAGCCCC 
GTACTCGGGG 


CTTTGGTGGG 
GAAACCACCC 
hTRP-2 


GGTTTCTGCT 
CCAAAGACGA 


CAGTTGCTTG 
GTCAACGAAC 


15 


6301 


GGCTGCAAAA 
CCGACGTTTT 


TCCTGCCAGG 
AGGACGGTCC 


AGCCCAGGGT 
TCGGGTCCCA 
hTRP-2 


CAGTTCCCCC 
GTCAAGGGGG 


GAGTCTGCAT 
CTCAGACGTA 


20 


6351 


GACGGTGGAC 
CTGCCACCTG 


AGCCTAGTGA 
TCGGATCACT 


ACAAGGAGTG 
TGTTCCTCAC 
hTRP-2 


CTGCCCACGC 
GACGGGTGCG 


CTGGGTGCAG 
GACCCACGTC 


64.01 


AGTCGGCCAA 
TCAGCCGGTT 


TGTCTGTGGC 
ACAGACACCG 


TCTCAGCAAG 
AGAGTCGTTC 
hTRP-2 


GCCGGGGGCA 
CGGCCCCCGT 


GTGCACAGAG- 
CACGTGTCTC 


25 


6451 


GTGCGAGCCG 
CACGCTCGGC 


ACACAAGGCC 
TGTGTTCCGG 


CTGGAGTGGT 
GACCTCACCA 
hTRP-2 


CCCTACATCC 
GGGATGTAGG 


TACGAAACCA 
ATGCTTTGGT 


30 


6501 


GGATGACCGT 
CCTACTGGCA 


GAGCTGTGGC 
CTCGACACCG 


CAAGAAAATT 
GTTCTTTTAA 
hTRP-2 


CTTCCACCGG 
GAAGGTGGCC 


ACCTGCAAGT 
TGGACGTTCA 


35 


6551 


GCACAGGAAA 
CGTGTCCTTT. 


CTTTGCCGGC 
GAAACGGCCG 


TATAATTGTG 
ATATTAACAC 
hTRP-2 


GAGACTGCAA 
CTCTGACGTT 


GTTTGGCTGG 
CAAACCGACC 


40 


6601 


ACCGGTCCCA 
TGGCCAGGGT 


ACTGCGAGCG 
TGACGCTCGC 


GAAGAAACCA 
CTTCTTTGGT 
hTRP-2 


CCAGTGATTC 
GGTCACTAAG 


GGCAGAACAT 
CCGTCTTGTA 


6651 


■CCATTCCTTG 
GGTAAGGAAC 


AGTCCTCAGG 
TCAGGAGTCC 


AAAGAGAGCA 
TTTCTCTCGT 
hTRP-2 


GTTCTTGGGC 
CAAGAACCCG 


GCCTTAGATC 
CGGAATCTAG . 


45 


6701 


TCGCGAAGAA 
AGCGCTTCTT 


GAGAGTACAC 
CTCTCATGTG 


CCCGACTACG 
GGGCTGATGC 
hTRP-2 


TGATCACCAC 
ACTAGTGGTG 


ACAACACTGG 
TGTTGTGACC 


50 


6751 


CTGGGCCTGC 
GACCCGGACG 


TTGGGCCCAA TGGAACCCAG 
AACCCGGGTT ACCTTGGGTC 
hTRP-2 • 


CCGCAGTTTG 
GGCGTCAAAC 


CCAACTGCAG 
GGTTGACGTC 




6801 


TGTTTATGAT 
ACAAATACTA 


TTCTTCGTGT 
AAGAAGCACA 


GGCTCCATTA 
CCGAGGTAAT 


TTATTCTGTT 
AATAAGACAA 


AGAGATACAT 
TCTCTATGTA 
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hTRP-2 . 



6851 TATTAGGACC AGGACGCCCC TACAGGGCCA TAGATTTCTC ACATCAAGGA 
ATAATCCTGG TCCTGCGGGG ATGTCCCGGT ATCTAAAGAG TGTAGTTCCT 

hTRP-2 



6901 CCTGCATTTG TTACCTGGCA CCGGTACCAT TTGTTGTGTC TGGAAAGAGA 
GGACGTAAAC AATGGACCGT GGCCATGGTA AACAACACAG ACCTTTCTCT 

hTRP-2 



■ 6951 TCTCCAGCGA CTCATTGGCA ATGAGTGTTT TGCTTTGCCC TACTGGAACT 
AGAGGTCGCT GAGTAACCGT TACTCAGAAA ACGAAACGGG ATGACCTTGA 

hTRP-2 . 



7001 TTGCCACTGG GAGGAACGAG TGTGATGTGT GTACAGACCA GCTGTTTGGG 
AACGGTGACC CTCCTTGCTC ACACTACACA CATGTCTGGT CGACAAACCC 

hTRP-2 



7051 GCAGCGAGAC CAGACGATCC GACTCTGATT AGTCGGAACT CAAGATTCTC 
CGTCGCTCTG GTCTGCTAGG CTGAGACTAA TCAGCCTTGA' GTTCTAAGAG 

hTRP-2 



7101 CAGCTGGGAA ACTGTCTGTG ATAGCTTGGA TGACTACAAC CACCTGGTCA 
GTCGACCCTT TGACAGACAC TATCGAACCT ACTGATGTTG GTGGACCAGT 

hTRP-2 



7151 CCTTGTGCAA TGGAACCTAT. GAAGGTTTGC TGAGAAGAAA. TCAAATGGGA 
GGAACACGTT ACCTTGGATA CTTCCAAACG ACTCTTCTTT AGTTTACCCT 

hTRP-2 



7201 AGAAACAGCA TGAAATTGCC AACCTTAAAA G AC AT AC GAG ATTGCCTGTC 
TCTTTGTCGT ACTTTAACGG TTGGAATTTT CTGTATGCTC TAACGGACAG 

hTRP-2 



7251' TCTCCAGAAG TTTGACAATC CTCCCTTCTT CCAGAACTCT ACCTTCAGTT 
AGAGGTCTTC AAACTGTTAG GAGGGAAGAA GGTCTTGAGA TGGAAGTCAA 

hTRP-2 



7301 TCAGGAATGC TTTGGAAGGG TTTGATAAAG CAGATGGGAC TCTGGATTCT 
AGTCCTTACG AAACCTTCCC AAACTATTTC GTCTACCCTG AGACCTAAGA 

hTRP-2 



7351 CAAGTGATGA GCCTTCATAA TTTGGTTCAT TCCTTCCTGA ACGGGACAAA 
GTTCACTACT CGGAAGTATT AAACCAAGTA AGGAAGGACT TGCCCTGTTT 

hTRP-2 



7401 CGCTTTGCCA CATTCAGCCG CCAATGATCC CATCTTCGTG GTGATTTCTA 
GCGAAACGGT GTAAGTCGGC GGTTACTA'GG GTAGAAGCAC CACTAAAGAT 

hTRP-2 



7451 ATCGTTTGCT* TTACAATGCT ACAACAAACA TCCTTGAACA TGTAAGAAAA 
TAGCAAACGA AATGTTACGA TGTTGTTTGT AGGAACTTGT ACATTCTTTT 

hTRP-2 



7501 ■ GAGAAAGCGA CCAAGGAACT CCCTTCCCTG CATGTGCTGG TTCTTCATTC 
CTCTTTCGCT GGTTCCTTGA GGGAAGGGAC GTACACGACC AAGAAGTAAG 
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hTRP-2 



5 


7551 


CTTTACTdAT 
GAAATGACTA 


GCCATCTTTG ATGAGTGGAT GAAAAGATTT AATCCTCCTG 
CGGTAGAAAC TACTCACCTA CTTTTCTAAA TTAGGAGGAC 
hTRP-2 


10 


7601 


CAGATGCCTG 
GTCTACGGAC 


GCCTCAGGAG CTGGCCCCTA TTGGTCACAA TCGGATGTAC 
CGGAGTCCTC GACCGGGGAT AACCAGTGTT AGCCTACATG 
hTRP-2 


7651 


. AACATGGTTC 
TTGTACCAAG 


CTTTCTTCCC TCCAGTGACT AATGAAGAAC TCTTTTTAAC 
GAAAGAAGGG AGGTCACTGA TTACTTCTTG AGAAAAATTG 
hTRP-2 


15 


7701 


CTCAGACCAA CTTGGCTACA GCTATGCCAT CGATCTGCCA GTTTCAGTTG 
GAGTCTGGTT GAACCGATGT CGATACGGTA GCTAGACGGT CAAAGTCAAC 

hTRP-2 


20 . 


7751 


AAGAAACTCC 
TTCTTTGAGG 


AGGTTGGCCC ACAACTCTCT TAGTAGTCAT GGGAACACTG 
TCCAACCGGG TGTTGAGAGA ATCATCAGTA CCCTTGTGAC 
hTRP-2 


25 


7801 


GTGGCTTTGG TTGGTCTGTT CGTGCTGTTG GCTTTTCTTC AATATAGAAG 
CACCGAAACC AACCAGACAA GCACGACAAC CGAAAAGAAG TTATATCTTC 

hTRP-2 


30 


7851 


ACTTCGAAAA GGATATACAC CCC.TAATGGA GACACATTTA AGCAGCAAGA 
TGAAGCTTTT CCTATATGTG GGGATTACCT CTGTGTAAAT TCGTCGTTCT 
hTRP-2 


7901 


GATACACAGA 
CTATGTGTCT 


AGAAGCCTAG TTTTTTAATT AAGCATGC.TC TAGAATCGAT 
TCTTCGGATC AAAAAATTAA TTCGTACGAG ATCTTAGCTA 

C5 Left Arm 


35 


7951 


CCCGGGTTTT 
GGGCCCAAAA 


TATGACTAGT TAATCACGGC CGCTTATAAA GATCTAAAAT 
ATACTGATCA ATTAGTGCCG GCGAATATTT CTAGATTTTA 
C5, Left Arm 


40 


8001 


GCATAATTTC 
CGTATTAAAG 


TAAATAATGA AAAAAAAGTA CATCATGAGC AACGCGTTAG. 
ATTTATTACT TTTTTTTCAT GTAGTACTCG TTGCGCAATC 
C5 Left Arm 


45 


8051 


TATATTTTAC 
ATATAAAATG 


AATGGAGATT AACGCTCTAT ACCGTTCTAT GTTTATTGAT 
TTACCTCTAA TTGCGAGATA TGGCAAGATA CAAATAACTA 
C5 Left Arm 


50 


8101 


TCAGATGATG 
AGTCTACTAC 


TTTTAGAAAA GAAAGTTATT GAATATGAAA ACTTTAATGA 
AAAATCTTTT CTTTCAATAA CTTATACTTT TGAAATTACT 
C5 Left Arm 


8151 


AGATGAAGAT 
TCTACTTCTA 


GACGACGATG ATTATTGTTG TAAATCTGTT TTAGATGAAG 
CTGCTGCTAC TAATAACAAC ATTTAGACAA AATCTACTTC 
C5. Left Arm 



55 " 8201 AAGATGACGC GCTAAAGTAT ACTATGGTTA CAAAGTATAA GTCTATACTA 



TTCTACTGCG CGATTTCATA TGATACCAAT GTTTCATATT CAGATATGAT 
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C5 Left Am 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 





GATTACCGCT 


C T T Wi bt- Aftb /u\o V? I AT AG T 
GAACACGTTC TTCCATATCA 
C5 Left Arm 


ATAGTGAAAA 
TATCACTTTT 


TGTTGTTAGA 
ACAACAATCT 


8301 


TTATGATTAT 
AATACTAATA 


GAAAAACCAA ATAAATCAGA 
CTTTTTGGTT TATTTAGTCT 
• C5 Left Arm 


TCCATATCTA 
AGGTATAGAT 


AAGGTATCTC 
TTCCATAGAG 


8351 


CTTTGCACAT 
GAAACGTGTA 


AATTTCATCT ATTCCTAGTT 
TTAAAGTAGA TAAGGATCAA 
C5 Left Arm 


TAGAATACTT 
ATCTTATGAA 


TTCATTATAT 
AAGTAATATA 



8401 

8451 ■ 
. 8501 
8551 
860.1 
8651 
8701 
8751 
8801 
8851 
8901 
8951 
9001 
9051 

9101 

9151 

9201 



TTGTTTACAG CTGAAGACGA AAAAAATATA TCGATAATAG AAGATTATGT 
AACAAATGTC GACTTCTGCT TTTTTTATAT AGCTATTATC TTCTAATACA 
C5 Left Arm' 



* TAACTCTGCT 
ATTGAGACGA 
TTGGCACTGG 
AACCGTGACC 
TACCCAACTT 
' ATGGGTTGAA 
ATAGCGAAGA 
TATCGCTTCT 
AATGGCGAAT 
TTACCGCTTA 
. TATTTCACAC 
ATAAAGTGTG 
CATAGTTAAG 
GTATCAATTC 
CGGGCTTGTC 
GCCCGAACAG 
CGGGAGCTGC 
GCCCTCGACG 
GACGAAAGGG 
CTGCTTTCCC 
ATAATGGTTT 
TATTACCAAA 
AACCCCTATT 
TTGGGGATAA 
TGAGACAATA" 
ACTCTGTTAT 



AATAAGATGA 
TTATTCTACT 
. CCGTCGTTTT 
GGCAGCAAAA 
AATCGCCTTG 
TTAGCGGAAC 
GGCCCGCACC 
CCGGGCGTGG 
GGCGCCTGAT 
CCGCGGACTA 
CGCATATGGT 
GCGTATACCA 
CCAGCCCCGA 
GGTCGGGGCT 
TGCTCCCGGC 
ACGAGGGCCG 
ATGTGTCAGA 
TACACAGTCT 
CCTCGTGATA 
GGAGCACTAT 
CTTAGACGTC 
GAATCTGCAG 
TGTTTATTTT 
ACAAATAAAA 
ACCCTGATAA 
TGGGACTATT 



AATTGAATGA 

TTAACTTACT 

ACAACGTCGT 

TGTTGCAGCA 

CAGCACATCC 

GTCGTGTAGG 

GATCGCCCTT 

CTAGCGGGAA 

GCGGTATTTT 

CGCCATAAAA 

GCACTCTCAG 

CGTGAGAGTC 

CACCCGCCAA 

GTGGGCGGTT 

ATCCGCTTAC 

TAGGCGAATG 

GGTTTTCACC 

CCAAAAGTGG . 

CGCCTATTTT 

GCGGATAAAA 

AGGTGGCACT 

TCCACCGTGA 

TCTAAATACA 

AGATTTATGT 

ATGCTTCAAT 

TACGAAGTTA 

Amp (R) 



GTCTGTGACT 

CAGACACTGA 

GACTGGGAAA 

CTGACCCTTT 

CCCTTTCGCC 

GGGAAAGCGG 

CCCAACAGTT 

GGGTTGTCAA 

CTCCTTACGC 

GAGGAATGCG 

TACAATCTGC 

ATGTTAGACG 

CACCCGCTGA 

GTGGGCGACT 

AGACAAGCTG 

TCTGTTCGAC 

GTCATCACCG 

CAGTAGTGGC 

TATAGGTTAA 

ATATCCAATT 

TTTCGGGGAA 

AAAGCCCCTT 

TTCAAATATG 

AAGTTTATAC 

AATATTGAAA 

TTATAACTTT 



GCAGCCAAGC 

CGTCGGTTCG 

ACCCTGGCGT 

TGGGACCGCA 

AGCTGGCGTA 

TCGACCGCAT 

GCGCAGCCTG 

CGCGTCGGAC 

ATCTGTGCGG 

TAGACACGCC 

TCTGATGCCG 

AGACTACGGC 

CGCGCCCTGA 

GCGCGGGACT 

TGACCGTCTC 

ACTGGCAGAG 

AAACGCGCGA 

TTTGCGCGCT 

TGTCATGATA 

ACAGTACTAT 

ATGTGCGCGG 

TACACGCGCC 

TATCCGCTCA 

ATAGGCGAGT 

AAGGAAGAGT 

TTCCTTCTCA 



ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 
TACTCATAAG TTGTAAAGGC ACAGCGGGAA TAAGGGAAAA AACGCCGTAA 

Amp (R) 

TTGCCTTCCT' GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 
AACGGAAGGA CAAAAACGAG TGGGTCTTTG CGACCACTTT CATTTTCTAC 

Amp(R) 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 
GACTTCTAGT CAACCCACGT GCTCACCCAA TGTAGCTTGA CCTAGAGTTG 
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Amp(R) 



5 


9251 


AGQGGTAAGA TCCTTGAGAG ■ TTTTCGCCCC GAAGAACGTT 
TCGCCATTCT AGGAACTCTC AAAAGCGGGG CTTCTTGCAA 

Amp (R) ' 


TTCCAATGAT 
AAGGTTACTA 


10 


9301* 


GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTATTGACG 
CTCGTGAAAA TTTCAAGACG ATACACCGCG CCATAATAGG GCATAACTGC 

Amp (R) 


9351 


. CCGGGCAAGA 
GGCCCGTTCT 


GCAACTCGGT 
CGTTGAGCCA 


CGCCGCATAC 
GCGGCGTATG 
Amp(R) 


ACTATTCTCA 
TGATAAGAGT 


GAATGACTTG 
CTTACTGAAC 


15 


9401' 


GTTGAGTACT 
CAACTCATGA 


CACCAGTCAC AGAAAAGCAT 
GTGGTCAGTG TCTTTTCGTA 
Amp (R) 


CTTACGGATG 
GAATGCCTAC 


GCATGACAGT 
CGTACTGTCA 


20 . 


9451 


AAGAGAATTA 
TTCTCTTAAT 


TGCAGTGCTG 
ACGTCACGAC 


CCATAACCAT 
GGTATTGGTA 
Amp (R) 


GAGTGATAAC 
CTCACTATTG 


ACTGCGGCCA 
TGACGCCGGT 


25 


9501 


ACTTACTTCT 
TGAATGAAGA 


GACAACGATC 
CTGTTGCTAG 


GGAGGACCGA 
CCTCCTGGCT 
Amp (R) 


AGGAGCTAAC 
TCCTCGATTG 


CGCTTTTTTG' 
GCGAAAAAAC 


30 


9551 


CACAACATGG 
GTGTTGTACC 


GGGATCATGT 
CCCTAGTACA 


AAC.TCGCCTT 
TTGAGCGGAA 
Amp (R) 


GATCGTTGGG 
CTAGCAACCC 


AACCGGAGCT 
TTGGCCTCGA 


9601 


GAATGAAGCC 
CTTACTTCGG 


ATACCAAACG 
TATGGTTTGC 


ACGAGCGTGA 
TGCTCGCACT 
Amp (R) 


CACCACGATG 
GTGGTGCTAC 


CCTGTAGCAA 
GGACATCGTT 


35 


9651 ' 


TGGCAACAAC 
ACCGTTGTTG 


GTTGCGCAAA 
CAACGCGTTT 


CTATTAACTG 
GATAATTGAC 
Amp (R) 


GCGAACTACT 
CGCTTGATGA 


TACTCTAGCT 
ATGAGATCGA 


40 


9701 


TCCCGGCAAC 
AGGGCCGTTG 


AATTAATAGA 
TTAATTATCT 


CTGGATGGAG 
GACCTACCTC 
Amp (R) 


GCGGATAAAG 
CGCCTATTTC 


TTGCAGGACC. 
AACGTCCTGG 


45 


9751 


ACTTCTGCGC 
TGAAGACGCG 


TCGGCCCTTC 
AGCCGGGAAG 


CGGCTGGCTG 
GCCGACCGAC 
Amp (R) 


GTTTATTGCT 
CAAATAACGA 


GATAAATCTG 
CTATTTAGAC 


'50 


9801 


GAGCCGGTGA 
CTCGGCCACT 


GCGTGGGTCT 
CGCACCCAGA 


CGCGGTATCA 
GCGCCATAGT 
Amp (R) 


TTGCAGCACT 
AACGTCGTGA 


GGGGCCAGAT 
CCCCGGTCTA 


9851 


GGTAAGCCCT 
CCATTCGGGA 


CCCGTATCGT AGTTATCTAC 
GGGCATAGCA TCAATAGATG 
. Amp(R) 


ACGACGGGGA 
TGCTGCCCCT 


GTCAGGCAAC 
CAGTCCGTTG 



55 ' . 9901 TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 



ATACCTACTT GCTTTATCTG TCTAGCGACT CTATCCACGG AGTGACTAAT 



29/53 



WO 2005/026370 



PCT/US2004/028751 



Amp (R) 



10 



9951 
10001 
10051 
10101 
10151 
10201 
15 10251 
10301 
10351 

20 

10401 
10451 
25 10501 
10551 
10601 

30 

10651 
10701 
35 10751 
10801 
10851 

40 

10901 
10951 
45 11001 
11051 
11101 



50 



11151 



AGCATTGGTA 
TCGTAACCAT 
TTAAAACTTC 
AATTT.TGAAG 
TAATCTCATG 
ATTAGAGTAC 
CAGACCCCGT 
GTCTGGGGCA 
CGCGTAATCT- 
GCGCATTAGA 
TTGTTTGCCG 
AACAAACGGC 
TCAGCAGAGC 
AGTCGTCTCG 
GGCCACCACT 
CCGGTGGTGA 
AATCCTGTTA 
TTAGGACAAT 
GGTTGGACTC 
CCAAfcCTGAG 
ACGGGGGGTT 
TGCCCCCCAA 
ACTGAGATAC 
TGACTCTATG 
GGAGAAAGGC 
CCTCTTTCCG 
CGCACGAGGG 
GCGTGCTCCC 
CGGGTTTCGC 
GCCCAAAGCG 
GGGGGCGGAG 
CCCCCGCCTC 
CTGGCCTTTT 
GACCGGAAAA 
TGATTCTGTG 
ACTAAGACAC 
GCCGCAGCCG 
CGGCGTCGGC 
■GAGCGCCCAA 
CTCGCGGGTT 
ATGCAGCTGG 
TACGTCGACC 
ACGCAATTAA 
" TGCGTTAATT 
TTTATGCTTC 
AAATACGAAG 
TCACACAGGA 
AGTGTGTCCT 
ATTCTAAG 



ACTGTCAGAC 
TGACAGTCTG 
ATTTTTAATT 
TAAAAATTAA 
ACCAAAATCC 
TGGTTTTAGG 
AGAAAAGATC 
TCTTTTCTAG 
GCTGCTTGCA 
CGACGAACGT 
GATCAAGAGC 
CTAGTTCTCG 
GCAGATACCA 
CGTCTATGGT 
TCAAGAACTC 
AGTTCTTGAG 
CCAGTGGCTG 
GGTCACCGAC 
AAGACGATAG 
TTCTGCTATC 
CGTGCACACA 
GCACGTGTGT 
CTACAGCGTG 
GATGTCGCAC 
GGACAGGTAT 
CCTGTCCATA 
AGCTTCCAGG 
TCGAAGGTCC 
CACCTCTGAC 
GTGGAGACTG 
CCTATGGAAA 
GGATACCTTT 
GCTGGCCTTT 
CGACCGGAAA 
GATAACCGTA 
CTATTGGCAT 
AACGACCGAG 
TTGCTGGCTC 
TACGCAAACC 
ATGCGTTTQG 
CACGACAGGT 
GTGCTGTCCA 
TGTGAGTTAG 
ACACTCAATC 
CGGCTCGTAT 
GCCGAGCATA 
AACAGCTATG 
TTGTCGATAC- 



CAAGTTTACT 
GTTCAAATGA 
TAAAAGGATC 
ATTTTCCTAG 
CTTAACGTGA 
GAATTGCACT 
AAAGGATCTT 
TTTCCTAGAA 
AACAAAAAAA 
TTGTTTTTTT 
TACCAACTCT 
ATGGTTGAGA 
AATACTGTCC 
TTATGAGAGG 
TGTAGCACCG 
ACATCGTGGC 
CTGCCAGTGG 
GACGGTCACC 
TTACCGGATA 
AATGGCCTAT 
GCCCAGCTTG 
CGGGTCGAAC 
AGCTATGAGA 
TCGATACTCT 
CCGGTAAGCG 
GGCCATTCGC 
GGGAAACGCC 
CCCTTTGCGG 
TTGAGCGTCG 
AACTCGCAGC 
AACGCCAGCA 
TTGCGGTCGT 
TGCTCACATG 
ACGAGTGTAC 
TTACCGCCTT 
AATGGCGGAA 
CGCAGCGAGT 
GCGTCGCTCA 
GCCTCTCCCC 
CGGAGAGGGG 
TTCCCGACTG 
AAGGGCTGAC 
CTCACTCATT 
GAGTGAGTAA 
GTTGTGTGGA 
CAACACACCT 
ACCATGATTA 
TGGTACTAAT 



CATATATACT 
GTATATATGA 
TAGGTGAAGA 
ATCCACTTCT 
GTTTTCGTTC 
CAAAAGCAAG 
CTTGAGATCC 
GAACTCTAGG 
CCACCGCTAC 
GGTGGCGATG 
TTTTCCGAAG 
AAAAGGCTTC 
TTCTAGTGTA 
AAGATCACAT 
CCTACATACC 
GGATGTATGG 
CGATAAGTCG 
GCTATTCAGC 
AGGCGCAGCG 
TCCGCGTCGC 
GAGCGAACGA 
CTCGCTTGCT 
AAGCGCCACG 
TTCGCGGTGC 
GCAGGGTCGG 
CGTCCCAGCC 
TGGTATCTTT 
ACCATAGAAA 
ATTTTTGTGA 
TAAAAACACT 
ACGCGGCCTT 
TGCGCCGGAA 
TTCTTTCCTG 
AAGAAAGGAC 
TGAGTGAGCT 
ACTCACTCGA 
CAGTGAGCGA 
GTCACTCGCT 
GCGCGTTGGC 
CGCGCAACCG 
GAAAGCGGGC 
CTTTCGCCCG 
AGGCACCCCA 
TCCGTGGGGT 
ATTGTGAGCG 
TAACACTCGC 
CGAATTGAAT 
GCTTAACTTA 



TTAGATTGAT 
AATCTAACTA 
TCCTTTTTGA 
AGGAAAAACT 
CACTGAGCGT 
GTGACTCGCA 
TTTTTTTCTG 
AAAAAAAGAC 
CAGCGGTGGT 
GTCGCCACCA 
GTAACTGGCT 

cattgaccga 
gccgtagtta 
cggcatcaat 
tcgctctgct 
agcgagacga 
tgtGttaccg 
acagaatggc 
gtcgggctga. 
cagcccgact 
cctacaccga 
ggatgtggct 
cttcccgaag 
gaagggcttc 
aacaggagag 

TTGTCCTCTC 

ATAGTCCTGT 

TATCAGGACA 

TGCTCGTCAG 

ACGAGCAGTC 

TTTACGGTTC 

AAATGCCAAG 

CGTTATCCCC 

GCAATAGGGG 

GATACCGCTC 

CTATGGCGAG 

GGAAGCGGAA . 

CCTTCGCCTT 

CGATTCATTA 

■GCTAAGTAAT . 

AGTGAGCGCA 

TCACTCGCGT 

GGCTTTACAC 

CCGAAATGTG 

GATAACAATT 

CTATTGTTAA 

TGCGGCCGCA 

ACGCCGGCGT- 



30/53 



WO 2005/026370 



PCT/US2004/028751 



FIGUR E* 



C5 Right Arm 

/ 




31/53 



WO 2005/026370 



PCT7US2004/028751 



FIGURES 
DNA Sequence of donor plasmid pT3217 

C5 Right Arm 



5 


1 


TGAATGTTAA 
. ACTTACAATT 


ATGTTATACT TTGGATGAAG 
TACAATATGA AACCTACTTC 
C5 Right Arm 


CTATAAATAT GCATTGGAAA 
GATATTTATA CGTAACCTTT 


10 


51 


AATAATCCAT 
" TTATTAGGTA 


TTAAAGAAAG GATTCAAATA 
AATTTCTTTC CTAAGTTTAT 
C5 Right Arm 


CTACAAAACC TAAGCGATAA 
GATGTTTTGG ATTCGCTATT 


15 


. ioi 


TATGTTAACT 
ATACAATTGA 


AAGCTTATTC TTAACGACGC TTTAAATATA CACAAATAAA 
TTCGAATAAG AATTGCTGCG AAATTTATAT GTGTTTATTT 
C5 Right Arm 


20 


151 


CATAATTTTT 
GTATTAAAAA 


GTATAACCTA ACAAATAACT 
CATATTGGAT TGTTTATTGA 
C5 Right Arm 


AAAACATAAA AATAATAAAA 
TTTTGTATTT TTATTATTTT 


25 


' 201 


GGAAATGTAA 
CCTTTACATT 


TATCGTAATT ATTTTACTCA 
ATAGCATTAA TAAAATGAGT 
C5 Right Arm 


GGAATGGGGT TAAATATTTA 
CCTTACCCCA ATTTATAAA.T 


251 


TATCACGTGT 
ATAGTGCACA 


ATATCTATAC TGTTATCGTA 
TATAGATATG, ACAATAGCAT 
C5 Right Arm 


TACTCTTTAC AATTACTATT 
ATGAGAAATG TTAATGATAA 


30 


• 301 


ACGAATATGC 
TGCTTATACG 


AAGAGATAAT AAGATTACGT 
TTCTCTATTA TTCTAATGCA 
C5 Right Arm 


ATTTAAGAGA ATCTTGTCAT 
TAAATTCTCT TAGAACAGTA 


35 


351 


GATAATTGGG TACGACATAG TGATAAATGC TATTTCGCAT CGTTACATAA 
CTATTAACCC ATGCTGTATC ACTATTTACG ATAAAGCGTA GCAATGTATT 
C5 Right Arm 


40 


401 


AGTCAGTTGG 
TCAGTCAACC 


AAAGATGGAT TTGACAGATG 
TTTCTACCTA AACTGTCTAC 
C5 Right Arm 


TAACTTAATA GGTGCAAAAA 
ATTGAATTAT CCACGTTTTT 


45 


451 


TGTTAAATAA 
ACAATTTATT 


CAGCATTCTA TCGGAAGATA 
GTCGTAAGAT AGCCTTCTAT 
C5 Right Arm 


GGATACCAGT TATATTATAC 
CCTATGGTCA ATATAATATG 


501 


AAAAATCACT 
TTTTTAGTGA 


GGTTGGATAA AACAGATTCT 
CCAACCTATT TTGTCTAAGA 
' C5 Right Arm 


GCAATATTCG TAAAAGATGA 
CGTTATAAGC ATTTTCTACT 


50 


551 


AGATTACTGC 
TCTAATGACG 


GAATTTGTAA ACTATGACAA 
CTTAAACATT TGATACTGTT 


TAAAAAGCCA TTTATCTCAA 
ATTTTTCGGT AAATAGAGTT 
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C5 Right Arm 



5 


601 


CGACATCGTG 
GCTGTAGCAC 


TAATTCTTCC ATGTTTTATG 
ATTAAGAAGG TACAAAATAC 
C5 Right Arm 


TATGTGTTTC AGATATTATG 
ATACACAAAG TCTATAATAC 


.10 


651 


AGATTACTAT 
TCTAATGATA 


AAACTTTTTG TATACTTATA 
TTTGAAAAAC ATATGAATAT 
• C5 Right Arm 


TTCCGTAAAC 
AAGGCATTTG 


TATATTAATC 
ATATAATTAG 


• 701 


ATGAAGAAAA 
TACTTCTTTT 


TGAAAAAGTA TAGAAGCTGT TCACGAGCGG 
ACTTTTTCAT AT.CTTCGACA AGTGCTCGCC 
C5 Right Arm 


TTGTTGAAAA 
AACAACTTTT 


15 


751 


CAACAAAATT 
GTTGTTTTAA 


ATACATTCAA GATGGCTTAC 
TATGTAAGTT CTACCGAATG 
C5 Right Arm 


ATATACGTCT 
TATATGCAGA 


GTGAGGCTAT 
CACTCCGATA 


20 


801 


CATGGATAAT 
GTACCTATTA 


GACAATGCAT CTCTAAATAG GTTTTTGGAC 
CTGTTACGTA GAGATTTATC CAAAAACCTG 
C5 Right Arm 


AATGGATTCG 
TTACCTAAGC 


25 


'851 


ACCCTAACAC 
TGGGATTGTG 


GGAATATGGT ACTCTAGAAT CTCCTCTTGA 
CCTTATACCA TGAGATGTTA GAGGAGAACT 
C5 Right Arm 


AATGGCTGTA 
TTACCGACAT 


30 


901 


ATGTTCAAGA 
TACAAGTTCT 


ATACCGAGGC TATAAAAATC 
TATGGCTCCG ATATTTTTAG 
C5 Right Arm 


TTGATGAGGT 
AACTACTCCA 


ATGGAGCTAA 
TACCTCGATT 


951 


ACCTGTAGTT 
TGGACATCAA 


ACTGAATGCA CAACTTCTTG 
TGACTTACGT GTTGAAGAAC 
C5 Right Arm 


TCTGCATGAT 
AGACGTACTA 


GCGGTGTTGA 
CGCCACAACT 


35 


1001 


GAGACGACTA 
CTCTGCTGAT 


CAAAATAGTG AAAGATCTGT TGAAGAATAA CTATGTAAAC 
GTTTTATCAC TTTCTAGACA. ACTTCTTATT GATACATTTG 
C5 Right Arm. 


40 


' 1051 


AATGTTCTTT 
TTACAAGAAA 


ACAGCGGAGG CTTTACTCCT 
TGTCGCCTCC GAAATGAGGA 
C5 Right Arm 


TTGTGTTTGG 
AACACAAACC 


CAGCTTACCT 
GTCGAATGGA 


45 


1101 


TAACAAAGTT 
ATTGTTTCAA 


AATTTGGTTA AACTTCTATT 
TTAAACCAAT TTGAAGATAA 
C5 Right Arm 


GGCTCATTCG 
CCGAGTAAGC 


GCGGATGTAG 
CGCCTACATC 


50 


1151 


ATATTTCAAA 
TATAAAGTTT 


CACGGATCGG TTAACTCCTC 
GTGCCTAGCC AATTGAGGAG 
C5 Right. Arm 


TACATATAGG 
ATGTATATCG 


CGTATCAAAT 
GCATAGTTTA 


1201 


AAAAATTTAA' CAATGGTTAA ACTTCTATTG 
TTTTTAAATT GTTACCAATT TGAAGATAAC 
C5 Right Arm 


AACAAAGGTG 
TTGTTTCCAC 


CTGATACTGA 
GACTATGACT 



55 1251 ' CTTGCTGGAT AACATGGGAT GTACTCCTTT AATGATCGCT GTACAATCTG 



GAACGACCTA TTGTACCCTA CATGAGGAAA TTACTAGCGA CATGTTAGAC 
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10 



15 



20 



25 



30 



35 



40 



45 



50 







C5 Right Arm 




1301 


GAAATATTGA AATATGTAGC- ACACTACTTA AAAAAAATAA AATGTCCAGA 
CT.TTATAACT TTATACATCG TGTGATGAAT TTTTTTTATT TTACAGGTCT 
C5 Right Arm 


1351 


ACTGGGAAAA 
TGACCCTTTT 


ATTGATCTTG CCAGCTGTAA TTCATGGTAG AAAAGAAGTG 
TAACTAGAAC GGTCGACATT AAGTACCATC TTTTCTTCAC 
C5 Right Arm 


1401 


CTCAGGCTAC 
GAGTCCGATG 


TTTTCAACAA AGGAGCAGAT GTAAACTACA 
AAAAGTTGTT TCCTCGTCTA CATTTGATGT 
C5 Right Arm 


TCTTTGAAAG 
AGAAACTTTC 


1451 ' 


AAATGGAAAA 
TTTACCTTTT 


TCATATACTG TTTTGGAATT GATTAAAGAA AGTTACTCTG 
AGTATATGAC AAAACCTTAA CTAATTTCTT TCAATGAGAC 
C5 Right Arm 


1501 


AGACACAAAA 
TCTGTGTTTT 


GAGGTAGCTG AAGTGGTACT CTCAAAGGTA 
CTCCATCGAC TTCACCATGA GAGTTTCCAT 
Repeat Region 


CGTGACTAAT 
GCAGTGATTA 


1551 


TAGCTATAAA 
ATCGATATTT 


AAGGATCGGG TTCTTTATTC TATACTTAAA 
TTCCTAGCCC AAGAAATAAG ATATGAATTT 
Repeat Region 


AAGTGAAAAT 
TT.CACTTTTA 


1601 


AAATACAAAG 
TTTATGTTTC 


GTTCTTGAGG GTTGTGTTAA ATTGAAAGCG 
CAAGAACTCC CAACACAATT TAACTTTCGC 
Repeat Region 


AGAAATAATC 
TCTTTATTAG 


1651 


ATAAATTATT 
TATTTAATAA 


TCATTATCGC GATATCCGTT AAGTTTGTAT 
AGTAATAGCG CTATAGGCAA TTCAAACATA 
Repeat Region 


CGTAATCTGC 
GCATTAGACG 


1701 


AGCCCCCACC 
TCGGGGGTGG 


ATGGATCTGG TGCTAAAAAG ATGCCTTCTT 
TACCTAGACC ACGATTTTTC TACGGAAGAA 
Repeat Region 


CATTTGGCTG 
GTAAACCGAC 


1751 


TGATAGGTGC 
ACTATCCACG 


TTTGCTGGCT GTGGGGGCTA CAAAAGTACC 
AAACGACCGA CACCCCCGAT GTTTTCATGG 
Repeat Region 


cagaaaccag". 

GTCTTTGGTC 


1801 


GACTGGCTTG 
CTGACCGAAC 


GTGTCTCAAG GCAACTCAGA ACCAAAGCCT 
CACAGAGTTC CGTTGAGTCT TGGTTTCGGA 
Repeat Region 


GGAACAGGCA 
CCTTGTCCGT 


1851 


GCTGTATCCA 
CGACATAGGT 


GAGTGGACAG AAGCCCAGAG ACTTGACTGC 
CTCACCTGTC TTCGGGTCTC TGAACTGACG 
Repeat Region 


TGGAGAGGTG 
ACCTCTCCAC 


1901 


GTCAAGTGTC 
CAGTTCACAG 


CCTCAAGGTC AGTAATGATG GGCCTACACT 
GGAGTTCCAG TCATTACTAC CCGGATGTGA 
Repeat Region 


GATTGGTGCA 
CTAACCACGT 



55 ' 1951 AATGCCTCCT TCTCTATTGC CTTGAACTTC CCTGGAAGCC AAAAGGTATT 
TTACGGAGGA AGAGATAACG GAACTTGAAG GGACCTTCGG TTTTCCATAA 
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Repeat Region C1B promoter 



2001 


GCCAGATACT AGTTCTAGAG GATCATTATT TAACGTAAAC TAAATGGAAA 
CGGTCTATGA TCAAGATCTC CTAGTAATAA ATTGCATTTG ATTTACCTTT 
C1B promoter 


2051 • 


AGCTATTTAC AGGTACATAC GGTGTTTTTC 
TCGATAAATG TCCATGTATG CCACAAAAAG 
• C1B promoter 


TGGAATCAAA TGATTCTGAT 
ACCTTAGTTT ACTAAGACTA 


2101 


TTTGAGGATT TTATCAATAC AATAATGACA 
AAACTCCTAA AATAGTTATG TTATTACTGT. 

C1B promoter 


GTGCTAACTG 
CACGATTGAC 


GTAAAAAAGA 
CATTTTTTCT 


2151 


AAGCAAACAA TTATCATGGC TAACAATTTT 
TTCGTTTGTT AATAGTACCG ATTGTTAAAA 
C1B promoter 


TATTATATTT 
ATAATATAAA 


GTAGTATGCA 
CATCATACGT 


2201 ■ 


TAGTGGTCTT TACGTTTCTT TATTTAAAGT TAATGTGTTA AGATTAAATG 
ATCACCAGAA ATGCAAAGAA ATAAATTTCA ATTACACAAT ' TCTAATTTAC 
C1B promoter LacZ 


2251 


GAGTAATTGG ATCCCCCATC 
' CTCATTAACC TAGGGGGTAG 


GATGGGGAAT 
CTACCCCTTA 
LacZ 


TCACTGGCCG 
AGTGACCGGC 


TCGTTTTACA 
AGCAAAATGT 


2301 


ACGTCGTGAC TGGGAAAACC 
TGCAGCACTG ACCCTTTTGG 


CTGGCGTTAC 
GACCGCAATG 
LacZ 


CCAACTTAAT. 
GGTTGAATTA 


CGCCTTGCAG 
GCGGAACGTC 


2351' 


CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 
GTGTAGGGGG AAAGCGGTCG ACCGCATTAT CGCTTCTCCG GGCGTGGCTA 

LacZ 


2401 


CGCCCTTCCC AACAGTTGCG 
GCGGGAAGGG TTGTCAACGC 


CAGCCTGAAT 
GTCGGACTTA 
LacZ 


GGCGAATGGC 
CCGCTTACCiG 


GCTTTGCCTG 
CGAAACGGAC 


2451 


. GTTTCCGGCA CCAGAAGCGG 
' CAAAGGCCGT GGTCTTCGCC 


TGCCGGAAAG 
ACGGCCTTTC 
LacZ 


CTGGCTGGAG 
GACCGACCTC 


TGCGATCTTC 
ACGCTAGAAG 


•2501 


CTGAGGCCGA TACTGTCGTC 
GACTCCGGCT ATGACAGCAG 


GTCCCCTCAA 
CAGGGGAGTT 
LacZ 


ACTGGCAGAT 
TGACCGTCTA 


GCACGGTTAC 
CGTGCCAATG 


2551 


GATGCGCCCA TCTACACCAA 
CTACGCGGGT. AGATGTGGTT 


CGTGACCTAT 
GCACTGGATA 
LacZ. 


CCCATTACGG 
GGGTAATGCC 


TCAATCGGCC 
AGTTAGGCGG 


2601 


GTXTGTTCCC ACGGAGAATC 
CAAACT^AGGG TGCCTCTTAG 


CGACGGGTTG 
GCTGCCCAAC 
LacZ 


TTACTCGCTC 
AATGAGCGAG 


ACATTTAATG 
TGTAAATTAC 



2651 TTGATGAAAG CTGGCTACAG GAAGGCCAGA CGCGAATTAT TTTTGATGGC 



AACTACT.TTC GACCGATGTC CTTCCGGTCT GCGCTTAATA AAAACTACCG 
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LacZ 



5 


2701 


GTTAACTCGG 
CAATTGAGCC 


CGTTTCATCT 
GCAAAGTAGA 


GTGGTGCAAC GGGCGCTGGG TCGGTTACGG 
CACCACGTTG CCCGCGACCC AGCCAATGCC 
LacZ 


10 


2751 


CCAGGACAGT 
GGTCCTGTCA 


CGTTTGCCGT 
GCAAACGGCA 


CTGAATTTGA CCTGAGCGCA TTTTTACGCG 
GACTTAAACT GGACTCGCGT AAAAATGCGC 
LacZ 


2801 


CCGGAGAAAA CCGCCTCGCG GTGATGGTGC TGCGCTGGAG TGACGGCAGT 
GGCCTCTTTT GGCGGAGCGC CACTACCACG ACGCGACCTC ACTGCCGTCA 

■ LacZ 


15 


28 51 


TATCTGGAAG 
ATAGACCTTC 


ATCAGGATAT 
TAGTCCTATA 


GTGGCGGATG AGCGGCATTT TCCGTGACGT 
CACCGCCTAC TCGCCGTAAA AGGCACTGCA 
LacZ 


20 


2901 


CTCGTTGCTG 
GAGCAACGAC 


CATAAACCGA 
GTATTTGGCT 


CTACACAAAT CAGCGATTTC CATGTTGCCA 
GATGTGTTTA GTCGCTAAAG ' GTACAACGGT 
LacZ 


25 


2951 


CTCGCTTTAA 
GAGCGAAATT 


TGATGATTTC 
ACTACTAAAG 


AGCCGCGCTG TACTGGAGGC TGAAGTTCAG 
TCGGCGCGAC ATGACCTCCG ACTTCAAGTC 
LacZ 


30 


3001 


ATGTGCGGCG 
TACACGCCGC 


AGTTGCGTGA -CTACCTACGG GTAACAGTTT- CTTTATGGCA 
TCAACGCACT GATGGATGCC CATTGTCAAA GAAATACCGT 
LacZ 


3051 


■ GGG TGAAACG 
CCCACTTTGC 


CAGGTCGCCA GCGGCACCGC GCCTTTCGGC GGTGAAATTA 
GTCCAGCGGT CGCCGTGGCG CGGAAAGCCG. CCACTTTAAT 
LacZ 


35 


3101 


TCGATGAGCG 
AGCTACTCGC 


TGGTGGTTAT 
ACCACCAATA 


GCCGATCGCG TCACACTACG TCTGAACGTC 
CGGCTAGCGC AGTGTGATG'C AGACTTGCAG 
LacZ 


40 


3151 


. GAAAACCCGA 
CTTTTGGGCT 


AACTGTGGAG 
TTGACACCTC 


CGCCGAAATC CCGAATCTCT ATCGTGCGGT 
GCGGCTTTAG GGCTTAGAGA TAGCACGCCA 
LacZ 


45 


3201 


GGTTGAACTG 
CCAACTTGAC 


CACACCGCCG 
GTGTGGCGGC 


ACGGCACGGT GATTGAAGCA GAAGCCTGCG 
TGCCGTGCGA CTAACTTCGT CTTCGGACGC 
LacZ 


50 


3251 


ATGTCGGTTT 
TACAGCCAAA. 


CCGCGAGGTG 
GGCGCTCCAC 


CGGATTGAAA ATGGTCTGCT GCTGCTGAAC 
GCCTAACTTT TACCAGACGA CGACGACTTG 
LacZ. 


3301 


GGCAAGCCGT' TGCTGATTCG 
CCGTTCGGCA ACGACTAAGC 


AGGCGTTAAC CGTCACGAGC ATCATCCTCT 
TCCGCAATTG GCAGTGCTCG TAGTAGGAGA 
LacZ 



55 3351 GCATGGTCAG GTCATGGATG AGCAGACGAT GGTGCAGGAT ATCCTGCTGA 



CGTACCAGTC CAGTACCTAC TCGTCTGCTA CCACGTCCTA TAGGACGACT 
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LacZ 



5 


3401 


TGAAGCAGAA 
ACTTCGTCTT 


CAACTTTAAC 
GTTGAAATTG 


GCCGTGCGCT GTTCGCATTA 
CGGCACGCGA CAAGCGTAAT 
LacZ 


TCCGAACCAT 
AGGCTTGGTA 


10 


3451 


CCGCTGTGGT 
GGCGACACCA 


ACACGCTGTG 
TGTGCGACAC 


CGACCGCTAC GGCCTGTATG 
GCTGGCGATG CCGGACATAC 
LacZ 


TGGTGGATGA 
ACCACCTACT 


3501 


AGCCAATATT 
TCGGTTATAA 


GAAACCCACG 
CTTTGGGTGC 


GCATGGTGCC 
CGTACCACGG 
LacZ 


AATGAATCGT 
TTACTTAGCA 


CTGACCGATG 
GACTGGCTAC 


15 


3551 


ATCCGCGCTG 
TAGGCGCGAC 


GCTACCGGCG 
CGATGGCCGC 


ATGAGCGAAC 
TACTCGCTTG 
LacZ 


GCGTAACGCG AATGG.TGCAG 
CGCATTGCGC TTACCACGTC 


20 


3601 


CGCGATCGTA 
GCGCTAGCAT 


ATCACCCGAG 
TAGTGGGCTC 


TGTGATCATC 
ACACTAGTAG 
LacZ 


TGGTCGCTGG 
ACCAGCGACC 


GGAATGAATC 
CCTTACTTAG 


25 


3651 


AGGCCACGGC 
TCCGGTGCCG 


GCTAATCACG 
CGATTAGTGC 


ACGCGCTGTA TCGCTGGATC 
TGCGCGACAT AGCGACCTAG 
LacZ 


AAATCTGTCG 
TTTAGACAGC 


30 


3701 


ATCCTTCCCG 
TAGGAAGGGC 


CCCGGTGCAG 
GGGCCACGTC 


TATGAAGGCG 
ATACTTCCGC 
LacZ 


GCGGAGCCGA 
CGCCTCGGCT 


CACCACGGCC 
GTGGTGCCGG 


3751 


ACCGATATTA 
TGGCTATAAT 


TTTGCCCGAT 
AAACGGGCTA 


GTACGCGCGC 
CATGCGCGCG 
LacZ 


GTGGATGAAG 
CACCTACTTC 


ACCAGCCCTT 
TGGTCGGGAA 


35 


3801 


CCCGGCTGTG 
GGGCCGACAC 


CCGAAATGGT 
GGCTTTACCA 


CCATCAAAAA 
GGTAGTTTTT 
LacZ 


ATGGCTTTCG 
TACCGAAAGC 


CTACCTGGAG 
GATGGACCTC 


40 


3851 


AGACGCGCCC 
TCTGCGCGGG 


GCTGATCCTT 
CGACTAGGAA 


TGCGAATACG 
ACGCTTATGC 
LacZ 


CCCACGCGAT 
GGGTGCGCTA 


GGGTAACAGT. 
CCCATTGTCA 


45 


3901 


CTTGGCGGTT TCGCTAAATA CTGGCAGGCG TTTCGTCAGJ ATCCCCGTTT 
GAACCGCCAA AGCGATTTAT GACCGTCCGC AAAGCAGTCA TAGGGGCAAA 

LacZ 


50 


3951 


ACAGGGCGGC TTCGTCTGGG 
TGTCCCGCCG AAGCAGACCC 


ACTGGGTGGA 
TGACCCACCT 
LacZ 


TCAGTCGCTG ATTAAATATG 
AGTCAGCGAC TAATTTATAC 


4001 


ATGAAAACGG 
TACTTTTGCC 


CAACCCGTGG 
GTTGGGCACC 


TCGGCTTACG 
AGCCGAATGC 
LacZ 


GCGGTGATTT TGGCGATACG 
CGCCACTAAA ACCGCTATGC 


55 


4051 


CCGAACGATC 
GGCTTGCTAG 


GCCAGTTCTG 
CGGTCAAGAC 


TATGAACGGT 
ATACTTGCCA 


CTGGTCTTTG 
GACCAGAAAC 


CCGACCGCAC 
GGCTGGCGTG 
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LacZ 



5 


4101 


GCCGCATCCA GCGCTGACGG AAGCAAAACA CCA<?CAGCAG TTTTTCCAGT 
CGGCGTAGGT CGCGACTGCC TTCGTTTTGT GGTCGTCGTC AAAAAGGTCA 

LacZ 


.10 


4151 


TCCGTTTATC 
AGGCAAATAG 


CGGGCAAACC 
GCCCGTTTGG 


ATCGAAGTGA 
TAGCTTCACT 
LacZ 


CCAGCGAATA CCTGTTCCGT 
GGTCGCTTAT GGACAAGGCA 


■ 4201 


CATAGCGATA ACGAGCTCCT 
GTATCGCTAT TGCTCGAGGA 


GCACTGGATG 
CGTGACCTAG 
LacZ 


GTGGCGCTGG 
CACCGCGACC 


ATGGTAAGCC 
TACCATTCGG 


15 


4251 


GCTGGCAAGC 
•CGACCGTTCG 


GGTGAAGTGC 
CCACTTCACG 


CTCTGGATGT 
GAGACCTACA 
- LacZ 


CGCTCCACAA 
GCGAGGTGTT 


GGTAAACAGT 
CCATTTGTCA 


20 


4301 


TGATTGAACT GCCTGAACTA CCGCAGCCGG 
ACTAACTTGA CGGACTTGAT GGCGTCGGCC 

LacZ 


AGAGCGCGGG 
TCTCGCGGCC 


GCAACTCTGG 
CGTTGAGACC 


.25 


4351 


CTCACAGTAC 
' GAGTGTCATG 


GCGTAGTGCA 
CGCATCACGT 


ACCGAACGCG 
TGGCTTGCGC 
LacZ 


ACCGCATGGT CAGAAGCCGG 
TGGCGTACCA GTCTTCGGCC 


30 


4401 


GCACATCAGC 
CGTGTAGTCG 


GCCTGGCAGC 
CGGACCGTCG 


AGTGGCGTCT 
TCACCGCAGA 
LacZ 


GGCGGAAAAC 
CCGCCTTTTG 


CTCAGTGTGA 
GAGTCACACT 


4451 


• CGCTCCCCGC 
GCGAGGGGCG 


CGCGTCCCAC 
GCGCAGGGTG 


GCCATCCCGC 
CGGTAGGGCG 
LacZ 


ATCTGACCAC CAGCGAAATG 
TAGACTGGTG GTCGCTTTAC 


35 


4501 


GATTTTTGCA 
CTAAAAACGT 


TCGAGCTGGG 
AGCTCGACCC 


TAATAAGCGT 
ATTATTCGCA 
LacZ 


TGGCAATTTA 
ACCGTTAAAT 


ACCGCCAGTC 
TGGCGGTCAG 


40 


4551 


AGGCTTTCTT 
TCCGAAAGAA 


TCACAGATGT 
AGTGTCTACA 


GGATTGGCGA 
CCTAACCGCT . 
LacZ 


TAAAAAACAA 
ATTTTTTGTT 


CTGCTGACGC 
GACGACTGCG 


45 


4601- 


CGCTGCGCGA" 
GCGACGCGCT 


TCAGTTCACC 
AGTCAAGTGG 


CGTGCACCGC 
GCACGTGGCG 
LacZ 


TGGATAACGA 
ACCTATTGCT 


CATTGGCGTA 
GTAACCGCAT 


50 


4651 


AGTGAAGCGA 
TCACTTCGCT. 


CCCGCATTGA 
GGGCGTAACT 


CCCTAACGCC 
GGGATTGCGG 
LacZ. 


TGGGTCGAAC GCTGGAAGGC 
ACCCAGCTTG CGACCTTCCG 


4701 


GGCGGGCCAT' TACCAGGCCG 
CCGCCCGGTA ATGGTCCGGC 


AAGCAGCGTT 
TTCGTCGCAA 
LacZ 


GTTGCAGTGC 
CAACGTCACG 


ACGGCAGATA 
TGCCGTCTAT 



55 4751 CACTTGCTGA TGCGGTGCTG ATTACGACCG CTCACGCGTG GCAGCATCAG 



GTGAACGACT ACGCCACGAC TAATGCTGGC GAGTGCGCAC CGTCGTAGTC 
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LacZ 



5 


4801 


GGGAAAACCT 
CCCTTTTGGA 


TATTTATCAG 
ATAAATAGTC 


- CCGGAAAACC 
GGCCTTTTGG 
LacZ 


TACCGGATTG 
ATGGCCTAAC 


ATGGTAGTGG 
TACCATCACC 


10 


4851 


TCAAATGGCG 
AGTTTACCGC 


ATTACCGTTG ATGTTGAAGT GGCGAGCGAT ACACCGCATC 
TAATGGCAAC TACAACTTCA CCGCTCGCTA TGTGGCGTAG 
LacZ 


4901 


. CGGCGCGGAT TGGCCTGAAC TGCCAGCTGG CGCAGGTAGC AGAGCGGGTA 
GCCGCGCCTA ACCGGACTTG ACGGTCGACC GCGTCCATCG TCTCGCCCAT 

LacZ 


15 


4951 ' 


AACTGGCTCG 
TTGACCGAGC 


GATTAGGGCC 
CTAATCCCGG 


•GCAAGAAAAC 
CGTTCTTTTG 
LacZ 


TATCCCGACC 
ATAGGGCTGG 


GCCTTACTGC 
CGGAATGACG 


20 . 


5001 


CGCCTGTTTT 
GCGGACAAAA 


GACCGCTGGG 
CTGGCGACCC 


ATCTGCCATT. GTCAGACATG 
TAGACGGTAA CAGTCTGTAC 
LaeZ 


TATACCCCGT 
ATATGGGGCA 


25 


5051 


ACGTCTTCCC 
TGCAGAAGGG 


GAGCGAAAAC 
CTCGCTTTTG 


GGTCTGCGCT 
CCAGACGCGA 
.LacZ 


GCGGGACGCG 
CGCCCTGCGC 


CGAATTGAAT 
GCTTAACTTA 


30 


5101 


TATGGCCCAC 
ATACCGGGTG 


ACCAGTGGCG CGGCGACTTC 
TGGTCACCGC GCCGCTGAAG 
LacZ 


CAGTTCAACA 
GTCAAGTTGT 


TCAGCCGGTA 
AGTCGGCCAT 


D1D1 


CAGTCAACAG 
GTCAGTTGTC 


CAATTGATGG 
GTTAACTACC 


AAACCAGCCA 
TTTGGTCGGT 
LacZ 


TTCGCCATCT 
AAGCGGTAGA 


GCTGCACGCG 
CGACGTGCGC 


35 


5201 1 


GAAGAGGCAC 
CTTCTCCGTG 


ATGGCTGAAT 
TACCGACTTA 


ATCGACGGTT TCCATATGGG 
TAGCTGCCAA AGGTATACCC 
LacZ 


GATTGGTGGC 
CTAACGACCG 


40 


5251 


GACGACTCCT 
CTGCTGAGGA 


GGAGCCCGTC 
CCTCGGGCAG 
. LacZ 


AGTATCGGCG 
TCATAGCCGC 


GAATTCCAGC 
CTTAAGGTCG 


TGAGCGCCGG. 
ACTCGCGGCC 



45 



50 



5301 
5351 

5401 



TCGCTACCAT TACCAGTTGG TCTGGTGTGA A7VAATAATAA TAACCGGGCA 
AGCGATGGTA ATGGTCAACC AGACCACAGT TTTTATTATT ATTGGCCCGT 
GGGGGGATCC GGAGCTTATC GCAGATCAAT TCGATATCAA GCTTATCGAT 
CCCCCCTAGG CCTCGAATAG CGTCTAGTTA AGCTATAGTT CGAATAGCTA 

.H6 Promoter 



ACCGTCGACC TCGAGTCTAG AATCGATCCC GGGTTCTTTA TTCTATACTT 
TGGCAGCTGG AGCTCAGATC TTAGCTAGGG CCCAAGAAAT AAGATATGAA 
H6 Promoter 



55 



5451 AAAAAGTGAA AATAAATACA AAGGTTCTTG AGGGTTGTGT TAAATTGAAA 
TTTTTCACTT TTATTTATGT TTCCAAGAAC TCCCAACACA ATTTAACTTT 
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H6 Promoter 



5 




oUwvaAAAlrt A1UA1AAA11 Al l ICAxTAl ^oCv?ATATCC GTTAAGTTTG 
CGCTCTTTAT TAGTATTTAA TAAAGTAATA GCGCTATAGG CAATTCAAAC 
H6 Promoter gplOO (M) 


.10 




lAlUtalAAlU 1GCAGCCCCC ACCATGGATC TGGTGCTAAA AAGATGCCTT 
ATAGCATTAG ACGTCGGGGG TGGTACCTAG ACCACGATTT TTCTACGGAA 

gpl00{M) 




uiiUAiiiWa CTGTGATAGG TGCTTTGCTG GCTGTGGGGG CTACAAAAGT 
GAAGTAAACC GACACTATCC ACGAAACGAC CGACACCCCC GATGTTTTCA 

gpl00(M) 


1 c 
13 


DOD1 


AUy UAb AAAU CAGGACTGGC TTGGTGTCTC AAGGCAACTC AGAACCAAAG 
TGGGTCTTTG GTCCTGACCG AACCACAGAG TTCCGTTGAG TCTTGGTTTC 

gpl00(M) 


20 


C-7A1 


CCTGGAACAG GCAGCTGTAT CCAGAGTGGA CAGAAGCCCA GAGACTTGAC 
GGACCTTGTC CGTCGACATA GGTCTCACCT GTCTTCGGGT' CTCTGAACTG 

gplOO(M) 


25 


D t 01 


TGCTGGAGAG GTGGTCAAGT GTCCCTCAAG GTCAGTAATG ATGGGCCTAC 
ACGACCTCTC CACCAGTTCA CAGGGAGTTC CAGTCATTAC TACCCGGATG 

gpl00(M) 


30 


JOU1 


A^lbATTGGT GCAAATGCCT CCTTCTCTAT TGCCTTGAAC TTCCCTGGAA 
TGACTAACCA CGTTTACGGA GGAAGAGATA ACGGAACTTG AAGGGACCTT 

gpl00(M) 


Do 01 


GCCAAAAGGT ATTGCCAGAT GGGCAGGTTA TCTGGGTCAA CAATACCATC 
CGGTTTTCCA TAACGGTCTA CCCGTCCAAT AGACCCAGTT GTTATGGTAG 

gpl00(M) 






AiuAAlUbGA GCXAGGTGTG GGGAGGACAG CCAGTGTATC CCCAGGAAAC 
TAGTTACCCT CGGTCCACAC CCCTCCTGTC GGTCACATAG GGGTCCTTTG 

gplOO.(M) 


40 


5951 


. iv3AuvaAibL,U lvoUAlLllOC CTUATGGTGG ACCTTGCCCA TCTGGCTCTT 
ACTGCTACGG ACGTAGAAGG. GACTACCACC TGGAACGGGT AGACCGAGAA 

gpl00(M) 


45 


6001 


GGTPTPAHAA fiAnAA^PTTT fiTTTlTCTPT ClC.T± nChrTrvr* nnnr*m\ -n m-A /~> 
1 v i wiunn o-tt>artrVO^ ill ijlilttlvjl^l oVaAAoA^V., 1 Vjj oCjvjUCAATAC 

CCAGAGTCTT CTCTTCGAAA CAAATACAGA CCTTCTGGAC CCCGGTTATG 

gplOO'(M) 


50 


6051 


TGGCAAGTTC TAGGGGGCCC AGTGTCTGGG CTGAGCATTG GGACAGGCAG 
ACCGTTCAAG ATCCCCCGGG TCACAGACCC GACTCGTAAC CCTGTCCGTC 

gpl00{M) 


6101 


GGCAATGCTG - GGCACACACA CGATGGAAGT GACTGTCTAC CATCGCCGGG ' 
CCGTTACGAC CCGTGTGTGT GCTACCTTCA CTGACAGATG GTAGCGGCCC 

gpl00(M) 



55 6151 GATCCCGGAG CTATGTGCCT CTTGCTCATT CCAGCTCAGC CTTCACCATT 



CTAGGGCCTC GATACACGGA GAACGAGTAA GGTCGAGTCG GAAGTGGTAA 
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gpIOO(M) 



5 


6201 


ATGGACCAGG 
TACCTGGTCC 


TGCCTTTCTC CGTGAGCGTG TCCCAGTTGC GGGCCTTGGA 
ACGGAAAGAG GCACTCGCAC AGGGTCAACG CCCGGAACCT 
gplOO(M) 


10 


6251 


TGGAGGGAAC 
ACCTCCCTTG 


AAGCACTTCC TGAGAAATCA GCCTCTGACC TTTGCCCTCC 
TTCGTGAAGG ACTCTTTAGT CGGAGACTGG AAACGGGAGG 
gplOO(M) 


6301 


. AGCTCCATGA 
TCGAGGTACT 


CCCCAGTGGC TATCTGGCTG 
GGGGTCACCG ATAGACCGAC 
gplOO (M) 


AAGCTGACCT CTCCTACACC 
TTCGACTGGA GAGGATGTGG 


15 


6351 


TGGGACTTTG 
ACCCTGAAAC 


GAGACAGTAG TGGAACCCTG 
CTCTGTCATC ACCTTGGGAC 
gplOO(M) 


ATCTCTCGGG 
TAGAGAGCCC 


CACTTGTGGT 
GTGAACACCA 


20 


64.01 


CACTCATACT 
GTGAGTATGA 


TACCTGGAGC CTGGCCCAGT 
ATGGACCTCG GACCGGGTCA 
gplOO(M) 


CACTGTTCAG 
GTGACAAGTC 


GTGGTCCTGC 
CACCAGGACG 


25 


6451 


AGGCTGCCAT 
TCCGACGGTA 


TCCTGTCACC TCCTGTGGCT 
AGGAGAGTGG AGGACACCGA 
gplOO(M) 


CCTCCCCAGT 
GGAGGGGTCA 


TCCAGGCACG 
AGGTCCGTGG 


30 


6501 


ACAGATGGGC 
TGTCTACCCG 


ACAGGCCAAC TGCAGAGGCC 
TGTCCGGTTG ACGTCTCCGG 
gplOO (M) 


CCTAACACCA 
GGATTGTGGT 


CAGCTGGCCA 
GTCGACCGGT 


6551 


AGTGCCTACT 
TCACGGATGA 


ACAGAAGTTG TGGGTACTAC 
TGTCTTCAAC ACCCATGATG 
gplOO(M) 


ACCTGGTCAG 
TGGACCAGTC 


GCGCCAACTG 
CGCGGTTGAC 


35 


6601 ' 


CAGAGCCCTC 
GTCTCGGGAG 


TGGAACCACA TCTGTGCAGG 
ACCTTGGTGT AGACACGTCC 
gplOO(M) 


TGCCAACCAC 
ACGGTTGGTG 


TGAAGTCATA 
ACTTCAGTAT 


40 


6651 


AGCACTGCAC 
TCGTGACGTG 


CTGTGCAGAT GCCAACTGCA 
GACACGTCTA CGGTTGACGT 
gplOO(M) 


GAGAGCACAG 
CTCTCGTGTC 


GTATGACACC. 
CATACTGTGG 


45 


6701 


TGAGAAGGTG 
ACTCTTCCAC 


CCAGTTTCAG AGGTCATGGG 
GGTCAAAGTC TCCAGTACCC 
gplOO(M) 


TACCACACTG 
ATGGTGTGAC 


GCAGAGATGT 
CGTCTCTACA 


50 


- 6751 


CAACTCCAGA 
GTTGAGGTCT 


GGCTACAGGT ATGACACCTG 
CCGATGTCCA TACTGTGGAC 
gplOO(M) 


CAGAGGTATC 
GTCTCCATAG 


AATTGTGGTG 
TTAACACCAC 


6801 


CTTTCTGGAA 
GAAAGACCTT 


CCACAGCTGC ACAGGTAACA 
GGTGTCGACG TGTCCATTGT 
.gplOO(M) 


ACTACAGAGT 
TGATGTCTCA 


GGGTGGAGAC 
CCCACCTCTG 



55' 6851 CACAGCTAGA GAGCTACCTA TCCCTGAGCC TGAAGGTCCA GATGCCAGCT 



GTGTCGATCT CTCGATGGAT AGGGACTCGG ACTTCCAGGT CTACGGTCGA 
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gplOO(M) 



5 


6901 


CAATCATGTC 
GTTAGTACAG 


TACGGAAAGT ATTACAGGTT CCCTGGGCCC CCTGCTGGAT 
ATGCCTTTCA TAATGTCCAA GGGACCCGGG GGACGACCTA 
gplOO(M) 


10 


6951 


GGTACAGCCA 
CCATGTCGGT 


CCTTAAGGCT GGTGAAGAGA CAAGTCCCCC TGGATTGTGT 
GGAATTCCGA CCACTTCTCT GTTCAGGGGG ACCTAACACA 
gplOO(M) 


7001 


TCTGTATCGA 
AGACATAGCT 


TATGGTTCCT TTTCCGTCAC CCTGGACATT GTCCAGGGTA 
ATACCAAGGA AAAGGCAGTG GGACCTGTAA CAGGTCCCAT 
gplOO(M) 


15 


7051 


TTGAAAGTGC 
AACTTTCACG 


CGAGATCCTG CAGGCTGTGC CGTCCGGTGA GGGGGATGCA 
GCTCTAGGAC GTCCGACACG GCAGGCCACT CCCCCTACGT 
gplOO(M) 


20 . 


7101 


TTTGAGCTGA 
AAACTCGACT 


CTGTGTCCTG CCAAGGCGGG CTGCCCAAGG AAGCCTGCAT 
GACACAGGAC GGTTCCGCCC GACGGGTTCC TTCGGACGTA 
gplOO(M) 


25 


7151 


GGAGATCTCA 
CCTCTAGAGT 


TCGCCAGGGT GCCAGCCCCC TGCCCAGCGG CTGTGCCAGC 
AGCGGTCCCA CGGTCGGGGG ACGGGTCGCC GACACGGTCG 
gplOO(M) 


30 


7201 


CTGTGCTACC 
GACACGATGG 


CAGCCCAGCC TGCCAGCTGG TTCTGCACCA GATACTGAAG 
GTCGGGTCGG ACGGTCGACC AAGACGTGGT CTATGACTTC 
gplOO(M) 


7251 


GGTGGCTCGG 
CCACCGAGCC 


GGACATACTG CCTCAATGTG TCTCTGGCTG ATACCAACAG 
CCTGTATGAC GGAGTTACAC AGAGACCGAC TATGGTTGTC 
gplOO(M) 


35 


■ 7301 


CCTGGCAGTG 
GGACCGTCAC 


GTCAGCACCC AG'CTTATCAT GCCTGGTCAA GAAGCAGGCC 
CAGTCGTGGG TCGAATAGTA CGGACCAGTT CTTCGTCCGG 
gplOO(M) 


40 


7351 


TTGGGCAGGT 
AACCCGTCCA 


TCCGCTGATC GTGGGCATCT TGCTGGTGTT GATGGCTGTG. 
AGGCGACTAG CACCCGTAGA ACGACCACAA CTACCGACAC 
gplOO(M) 


45 


7401 


GTCCTTGCAT CTCTGATATA TAGGCGCAGA CTTATGAAGG AAGACTTCTC 
. CAGGAACGTA GAGACTATAT ATCCGCGTCT GAATACTTCG TTCTGAAGAG 

gplOO(M) 


50 


7451 


CGTACCCCAG 
GCATGGGGTC 


TTGCCACATA GCAGCAGTCA CTGGCTGCGT CTACCCCGCA 
AACGGTGTAT CGTCGTCAGT GACCGACGCA GATGGGGCGT 
gplOO(M) 


7501 


TCTTCTGCTC 
AGAAGACGAG 
gplOO(M) ■ 


TTGTCCCATT GGTGAGAACA GCCCCCTCCT CAGTGGGCAG 
AACAGGGTAA CCACTCTTGT CGGGGGAGGA GTCACCCGTC 

42K promoter 


55 


7551 


CAGGTCTGAT 
GTCCAGACTA 


TTTTATTCTA GTTCAAAAAA ATATAAATGA TTCACCATCT 
AAAATAAGAT CAAGTTTTTT TATATTTACT AAGTGGTAGA 
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42K promoter 



5 


7601 


GATAGAAAAA AAATTTATTG 
CTATCTTTTT TTTAAATAAC 
42K promoter 


GGAGAATATG 
CCTCTTATAC 


ATAATATTTT 
TATTATAAAA 


GGGATTTCAA 
CCCTAAAGTT 
Mart-1 


10 


7651 


AATTGAAAAT 
TTAACTTTTA 


ATATAATTAC 
TATATTAATG 


AATATAAATC 
TTATATTTAG 
Mart-1 


TAGACCACCA 
ATCTGGTGGT 


TGCCAAGAGA 
ACGGTTCTCT 


7701 


AGATGCTCAC 
TCTACGAGTG 


TTCATCTATG 
AAGTAGATAC 


GTTACCCCAA 
CAATGGGGTT 
Mart-1 


GAAGGGGCAC 
CTTCCCCGTG 


GGCCACTCTT 
CCGGTGAGAA 


15 


7751 


ACACCACGGC 
TGTGGTGCCG 


TGAAGAGGCC 
ACTTCTCCGG 


GCTGGGATCG 
CGACCCTAGC 
Mart-1 


GCATCCTGAC 
CGTAGGACTG 


AGTGATCCTG 
TCACTAGGAC 


20 


7801 


GGAGTCTTAC 
CCTCAGAATG 


TGCTCATGGG..- CTGTTGGTAT 
ACGAGTAGCC GACAACCATA 
Mart-1 


TGTAGAAGAC 
ACATCTTCTG 


GAAATGGATA 
' CTTTACCTAT 


25 


7851 


CAGAGCCTTG ATGGATAAAA GTCTTCATGT 
GTCTCGGAAC TACCTATTTT CAGAAGTACA 

Mart-1 


TGGCACTCAA 
ACCGTGAGTT 


TGTGCCTTAA 
ACACGGAATT 


30 


7901 


CAAGAAGATG 
GTTCTTCTAC 


CCCACAAGAA 
GGGTGTTCTT 


GGGTTTGATC 
CCCAAACTAG 
Mart-1 


ATCGGGACAG 
TAGCCCTGTC 


CAAAGTGTCT 
GTTTCACAGA 


7951 


CTTCAAGAGA 
GAAGTTCTCT 


AAAACTGTGA ACCTGTGGTT 
TTTTGACACT TGGACACCAA 
Mart-1 


CCCAATGCTC 
GGGTTACGAG 


CACCTGCTTA 
GTGGACGAAT 


35 


8001 


TGAGAAACTC 
ACTCTTTGAG 


TCTGCAGAAC 
AGACGTCTTG 


AGTCACCACC ACCTTATTCA 
TCAGTGGTGG TGGAATAAGT 
sE/L Promoter 


CCTTAATCTA 
GGAATTAGAT 


40 


8051 


GAGTCGACCT GCAGGCATGC 
CTCAGCTGGA CGTCCGTACG 
sE/L Promoter 


AAAAATTGAA 
TTTTTAACTT 


ATTTTATTTT 
TAAAATAAAA 


TTTTTTTTGG 
AAAAAAAACC 










Mage 1-3 minigene 




45 


8101 


AATATAAATA 
TTATATTTAT 


ATGGAGTCCT TGCAGCTGGT CTTTGGCATT 
TACCTCAGGA ACGTCGACCA GAAACCGTAA 
Mage 1-3 minigene 


GACGTGAAGG 
CTGCACTTCC 


50 


8151 


AAGCAGACCC 
TTCGTCTGGG 


CACCGGCCAC 
GTGGCCGGTG 
Mage 1 


TCCTATGTCC 
AGGATACAGG 
.-3 minigene 


TTGTCACCTG 
AACAGTGGAC 

j 


CCTAGGTCTC 
GGATCCAGAG 




.8201 


TCCTATGATG 
AGGATACTAC 


GCAATAAGCG 
CGTTATTCGC 


TAAAGAAGTG 
ATTTCTTCAC 


GACCCCATCG 
CTGGGGTAGC 


GCCACTTGTA 
CGGTGAACAT 



55 
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Mage 1-3 minigene 



C5 Left Arm 



8251 



8301 



8351 



8401' 



8451 ' 



8501 



8551 



8601 



8651 



8701 



8751 



8801 



8851 



8901 



8951 



55 * 9001 



10 



15 



20 



25 



30 



35 



40 



45 



50 



CTAGTTTTTA 
GATCAAAAAT 


TCCCGGGTTT • TTATGACTAG TTAATCACGG CCGCTTATAA 
AGGGCCCAAA AATACTGATC AATTAGTGCC GGCGAATATT 
C5 Left Arm 


AGATCTAAAA 
TCTAGATTTT 


TGCATAATTT 
ACGTATTAAA 
C5 


CTAAATAATG AAAAAAAAGT ACATCATGAG 
GATTTATTAC TTTTTTTTCA TGTAGTACTC 
Left Arm 


CAACGCGTTA 
GTTGCGCAAT 


GTATATTTTA 
CATATAAAAT 
C5 


CAATGGAGAT TAACGCTCTA TACCGTTCTA 
GTTACCTCTA ATTGCGAGAT ATGGCAAGAT 
Left Arm 


TGTTTATTGA 
ACAAATAACT 


TTCAGATGAT 
AAGTCTACTA 
C5 


GTTTTAGAAA AGAAAGTTAT TGAATATGAA 
CAAAATCTTT TCTTTCAATA AQTTATACTT 
Left Arm 


AACTTTAATG .AAGATGAAGA TGACGACGAT GATTATTGTT GTAAATCTGT 
TTGAAATTAC TTCTACTTCT ACTGCTGCTA CTAATAACAA CATTTAGACA 
C5 Left Arm 


TTTAGATGAA 
AAATCTACTT 


GAAGATGACG 
CTTCTACTGC 
C5 


CGCTAAAGTA TACTATGGTT ACAAAGTATA 
GCGATTTCAT ATGATACCAA TGTTTCATAT 
Left Arm 


AGTCTATACT ACTAATGGCG 
TCAGATATGA TGATTACCGC 
C5 


ACTTGTGCAA GAAGGTATAG TATAGTGAAA 
TGAACACGTT CTTCCATATC ATATCACTTT 
Left Arm 


ATGTTGTTAG 
TACAACAATC 


ATTATGATTA 
TAATACTAAT 
C5 


TGAAAAACCA AATAAATCAG ATCCATATCT 
ACTTTTTGGT TTATTTAGTC TAGGTATAGA 
Left Arm 


AAAGGTATCT CCTTTGCACA TAATTTCATC TATTCCTAGT TTAGAATACT 
TTTCCATAGA GGAAACGTGT ATTAAAGTAG ATAAGGATCA AATCTTATGA 
C5 Left Arm 


TTTCATTATA 
AAAGTAATAT 


TTTGTTTACA 
AAACAAATGT 
■ C5 


GCTGAAGACG AAAAAAATAT ATCGATAATA. 
CGACTTCTGC TTTTTTTATA TAGCTATTAT 
Left Arm 



GAAGATTATG TTAACTCTGC TAATAAGATG AAATTGAATG AGTCTGTGAC 
CTTCTAATAC AATTGAGACG ATTATTCTAC TTTAACTTAC TCAGACACTG 
C5 Left Arm 



TGCAGCCAAG 
ACGTCGGTTC 
AACCCTGGCG 
TTGGGACCGC 
CAGCTGGCGT 
GTCGACCGCA 
TGCGCAGCCT 
ACGCGTCGGA 
CATCTGTGCG 
GTAGACACGC 



CTTGGCACTG 
GAACCGTGAC 
TTACCCAACT 
AATGGGTTGA- 
AATAGCGAAG 
TTATCGCTTC 
GAATGGCGAA 
CTTACCGCTT 
GTATTTCACA 
CATAAAGTGT 



GCCGTCGTTT 
CGGCAGCAAA 
TAATCGCCTT 
ATTAGCGGAA 
AGGCCCGCAC 
TCCGGGCGTG 
TGGCGCCTGA 
ACCGCGGACT 
CCGCATATGG 
GGCGTATACC 



TACAACGTCG 
ATGTTGCAGC 
GCAGCACATC 
CGTCGTGTAG 
CGATCGCCCT 
GCTAGCGGGA 
TGCGGTATTT 
ACGCCATAAA 
TGCACTCTCA 
ACGTGAGAGT 



TGACTGGGAA 
ACTGACCCTT 
CCCCTTTCGC 
GGGGAAAGCG 
TCCCAACAGT 
AGGGTTGTCA 
TCTCCTTACG 
AGAGGAATGC 
GTACAATCTG 
CATGTTAGAC 
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10 



15 



9051 
9101 
9151 
9201 
9251 
9301 
9351 



CTCTGATGCC GCATAGTTAA GCCAGCCCCG ACACCCGCCA ACACCCGCTG 
GAGACTACGG CGTATCAATT CGGTCGGGGC TGTGGGCGGT TGTGGGCGAC 
ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG CATCCGCTTA CAGACAAGCT 
TGCGCGGGAC TGCCCGAACA GACGAGGGCC GTAGGCGAAT GTCTGTTCGA 
GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTT.CAC CGTCATCACC 
CACTGGCAGA GGCCCTCGAC GTACACAGTC TCCAAAAGTG GCAGTAGTGG 
GAAACGCGCG AGACGAAAGG GCCTCGTGAT ACGCCTATTT TTATAGGTTA 
CTTTGCGCGC TCTGCTTTCC CGGAGCACTA TGCGGATAAA AATATCCAAT 
ATGTCATGAT AATAATGGTT TCTTAGACGT CAGGTGGCAC TTTTCGGGGA 
TACAGTACTA TTATTACCAA AGAATCTGCA GTCCACCGTG AAAAGCCCCT 
AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT 
TTACACGCGC CTTGGGGATA AACAAATAAA AAGATTTATG TAAGTTTATA 
GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA 
CATAGGCGAG TACTCTGTTA TTGGGACTAT TTACGAAGTT ATTATAACTT 

Amp(R) . 



20 


9401 


AAAGGAAGAG 
TTTCCTTGTC 


TATGAGTATT 
ATACTCATAA 


CAACATTTCC 
GTTGTAAAGG 
Amp(R) 


GTGTCGCCCT 
CACAGCGGGA 


TATTCCCTTT 
ATAAGGGAAA 


9451 


TTTGCGGCAT 
AAACGCCGTA 


TTTGCCTTCC 
AAACGGAAGG 


TGTTTTTGCT 
ACAAAAACGA 
Arap(R) 


CACCCAGAAA 
GTGGGTCTTT 


CGCTGGTGAA 
GCGACCACTT 


25 


9501 


AGTAAAAGAT GCTGAAGATC AGTTGGGTGC 
TCATTTTCTA CGACTTCTAG TCAACCCACG 

Amp(R), 


ACGAGTGGGT 
TGCTCACCCA 


TACATCGAAC 
ATGTAGCTTG 


30 


OCC1 

9551 


TGGATCTCAA CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC 
ACCTAGAGTT GTCGCCATTC TAGGAACTCT CAAAAGCGGG 

Amp(R) 


CGAAGAACGT 
GCTTCTTGCA 


35 ■ 


9601 


TTTCCAATGA 
AAAGGTTACT 


TGAGCACTTT 
ACTCGTGAAA 


TAAAGTTCTG 
ATTTCAAGAC 
Amp(R) 


CTATGTGGCG 
GATACACCGC 


CGGTATTATC 
GCCATAATAG 


40 


9651 


CCGTATTGAC 
GGCATAACTG 


GCGGGGCAAG 
CGGCCCGTTC 


AGCAACTCGG 
TCGTTGAGCC 
Amp (R) 


TCGCCGCATA 
AGCGGCGTAT 


CACTATTCTC 
GTGATAAGAG 


9701 


AGAATGACTT 
TCTTACTGAA 


GGTTGAGTAC 
CCAACTCATG 


TCACCAGTCA 
AGTGGTCAGT 
Amp (R) - 


CAGAAAAGCA 
GTCTTTTCGT 


TCTTACGGAT 
AGAATGCCTA 


45 


. 9751 


GGCATGACAG TAAGAGAATT ATGCAGTGCT GCCATAACCA TGAGTGATAA 
CCGTACTGTC ATTCTCTTAA TACGTCACGA CGGTATTGGT ACTCACTATT 

Amp(R) 


50 


9801 


CACTGCGGCC 
GTGACGCCGG 


AACTTACTTC 
TTGAATGAAG 


TGACAACGAT 
ACTGTTGCTA 
Amp (R) 


CGGAGGACCG 
GCCTCCTGGC 


AAGGAGCTAA 
TTCCTCGATT 




9851 


CCGCTTTTTT 
GGCGAAAAAA 


GCACAACATG 
CGTGTTGTAC 


GGGGATCATG 
CCCCTAGTAC 


TAACTCGCCT 
ATTGAGCGGA 


TGATCGTTGG 
ACTAGCAACC 
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Amp <R) 



10 



20 



25 



9901 



9951' 



10001 



15 10051' 



10101 



10151 



10201 



30 

10251 
10301 
35 10351 
10401 
10451 

40 

10501 
10551 
45 10601 
10651 
10701 

50 

10751 
10801 
55 .10851 



GAACCGGAGC 
CTTGGCCTCG 


TGAATGAAGC CATACCAAAC 
ACTTACTTCG GTATGGTTTG 
Amp <R) 


GACGAGCGTG ACACCACGAT 
CTGCTCGCAC TGTGGTGCTA 


GCCTGTAGCA 
CGGACATCGT 


ATGGCAACAA 
TACCGTTGTT 


CGTTGCGCAA 
GCAACGCGTT 
Amp(R) 


ACTATTAACT GGCGAACTAC 
TGATAATTGA CCGCTTGATG 


TTACTCTAGC 
AATGAGATCG 


TTCCCGGCAA 
AAGGGCCGTT 


CAATTAATAG 
GTTAATTATC 
Amp (R) 


ACTGGATGGA 
TGACCTACCT 


GGCGGATAAA 
CCGCCTATTT 


GTTGCAGGAC 
CAACGTCCTG 


CACTTCTGCG 
GTGAAGACGC 


CTCGGCCCTT 
GAGCCGGGAA 
Amp.(R) 


CCGGCTGGCT 
GGCCGACCGA 


GGTTTATTGC 
CCAAATAACG 


TGATAAATCT 
ACTATTTAGA 


GGAGCCGGTG 
CCTCGGCCAC 


AGCGTGGGTC 
TCGCACCCAG 
Amp (R) 


TCGCGGTATC 
AGCGCCATAG 


ATTGCAGCAC 
TAACGTCGTG 


TGGGGCCAGA 
ACCCCGGTCT 


TGGTAAGCCC 
ACCATTCGGG 


TCCCGTATCG 
AGGGCATAGC 
Amp<R) 


TAGTTATCTA 
ATCAATAGAT 


CACGACGGGG 
GTGCTGCCCC 


AGTCAGGCAA CTATGGATGA ACGAAATAGA 
TCAGTCCGTT GATACCTACT TGCTTTATCT 
Amp (R) 


CAGATCGCTG 
GTCTAGCGAC 


AGATAGGTGC 
TCTATCCACG 



CTCACTGATT 
GAGTGACTAA 
TTTAGATTGA 
AAATCTAACT 
ATCCTTTTTG 
TAGGAAAAAC 
CCACTGAGCG 
GGTGACTCGC 
CTTTTTTTCT 
GAAAAAAAGA 
■ CCAGCGGTGG 
GGTCGCCACC 
GGTAACTGGC 
CCATTGACCG 
AGCCGTAGTT 
TCGGCATCAA 
CTCGCTCTGC 
GAGCGAGACG 
GTGTCTTACC 
CACAGAATGG 
GGTCGGGCTG 
CCAGCCCGAC 
ACCTACACCG 
TGGATGTGGC 
GCTTCCCGAA 
CGAAGGGCTT 



AAGCATTGGT 
TTCGTAACCA 
TTTAAAACTT 
AAATTTTGAA 
ATAATCTCAT 
TATTAGAGTA 
TCAGACCCCG 
AGTCTGGGGC 
GCGCGTAATC 
CGCGCATTAG 
TTTGTTTGCC 
AAACAAACGG 
TTCAGCAGAG 
AAGTCGTCTC 
AGGCCACCAC 
TCCGGTGGTG 
TAATCCTGTT 
ATTAGGACAA 
GGGTTGGACT 
CCCAACCTGA 
AACGGGGGGT 
TTGCCCCCCA 
AACTGAGATA 
TTGACTCTAT 
GGGAGAAAGG 
CCCTCTTTCC 



AACTGTCAGA 
TTGACAGTCT 
CATTTTTAAT 
GTAAAAATTA 
GACCAAAATC 
CTGGTTTTAG 
TAGAAAAGAT 
ATCTTTTCTA 
TGCTGCTTGC 
ACGACGAACG 
GGATCAAGAG 
CCTAGTTCTC 
CGCAGATACC 
GCGTCTATGG 
TTCAAGAACT 
AAGTTCTTGA 
ACCAGTGGCT 
TGGTCACCGA 
CAAGACGATA 
GTTCTGCTAT 
TCGTGCACAC 
AGCACGTGTG 
CCTACAGCGT 
GGATGTCGCA 
CGGACAGGTA 
GCCTGTCCAT 



CCAAGTTTAC 
GGTTCAAATG 
TTAAAAGGAT 
AATTTTCCTA 
CCTTAACGTG 
GGAATTGCAC 
CAAAGGATCT 
GTTTCCTAGA 
AAACAAAAAA 
TTTGTTTTTT 
CTACCAACTC 
GATGGTTGAG 
AAATACTGTC 
TTTATGACAG 
CTGTAGCACC 
GACATCGTGG 
GCTGCCAGTG 
CGACGGTCAC 
GTTACCGGAT 
CAATGGCCTA 
AGCCCAGCTT 
TCGGGTCGAA 
GAGCTATGAG 
CTCGATACTC 
TCCGGTAAGC 
AGGCCATTCG 



TCATATATAC 

AGTATATATG 

CTAGGTGAAG 

GATCCACTTC 

AGTTTTCGTT 

TCAAAAGCAA 

TCTTGAGATC 

AGAACTCTAG 

ACCACCGCTA. 

TGGTGGCGAT 

TTTTTCCGAA 

AAAAAGGCTT . 

CTTCTAGTGT 

GAAGATCACA 

GCCTACATAC 

CGGATGTATG 

GCGATAAGTC 

CGCTATTCAG 

AAGGGGCAGC 

TTCCGCGTCG 

GGAGCGAACG 

CCTCGCTTGC 

AAAGCGCCAC 

TTTCGCGGTG 

GGCAGGGTCG 

CCGTCCCAGC 
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10 



15 



20 



10901 
10951 
11001 
11051 
11101 
11151 
11201 
11251 
11301 
11351 
11401 
11451 



GAACAGGAGA 
CTTGTCCTCT 
TATAGTCCTG 
ATATCAGGAC 
ATGCTCGTCA 
TACGAGCAGT 
TTTTACGGTT 
AAAATGCCAA 
GCGTTATCCC 
CGCAATAGGG 
TGATACCGCT 
ACTATGGCGA 
AGGAAGCGGA 
TCCTTCGCCT 
CCGATTCATT 
GGCTAAGTAA 
CAGTGAGCGC 
GTCACTCGCG 
AGGCTTTACA 
TCCGAAATGT 
GGATAACAAT 
CCTATTGTTA 
TTGCGGCCGC 



GCGCACGAGG 
CGCGTGCTCC 
TCGGGTTTCG 
AGCCCAAAGC 
GGGGGGCGGA 
CCCCCCGCCT 
CCTGGCCTTT 
GGACCGGAAA 
CTGATTCTGT 
GACTAAGACA 
CGCCGCAGCC 
GCGGCGTCGG 
AGAGCGCCCA 
TCTCGCGGGT 
AATGCAGCTG 
TTACGTCGAC 
AACGCAATTA 
TTGCGTTAAT 
CTTTATGCTT 
GAAATACGAA 
TTCACACAGG 
AAGTGTGTCC 
AATTCAACGC 



GAGCTTQCAG 
CTCGAAGGTC 
CCACCTCTGA 
GGTGGAGACT 
GCCTATGGAA 
CGGATACCTT 
TGCTGGCCTT 
ACGACCGGAA 
GGATAACCGT 
CCTATTGGCA 
GAACGACCGA 
CXTGCTGGCT 
ATACGCAAAC 
TATGCGTTTG 
GCAGGACAGG 
CGTGCTGTCC 
ATGTGAGTTA 
TACACTCAAT 
CCGGCTCGTA 
GGCCGAGCAT 
AAACAGCTAT 
TTTGTCGATA 
CGGCGTTAAG 



GGGGAAACGC 
CCCCTTTGCG 
CTTGAGCGTC 
GAACTCGCAG 
AAACGCCAGC 
TTTGCGGTCG 
TTGCTCACAT 
AACGAGTGTA 
ATTACCGCCT 
TAATGGCGGA 
GCGCAGCGAG 
CGCGTCGCTC 
CGCCTCTCCC 
GCGGAGAGGG 
TTTCCCGACT 
AAAGGGCTGA 
GCTCACTCAT 
CGAGTGAGTA 
TGTTGTGTGG 
ACAACACACC 
GACCATGATT 
CTGGTACTAA 



CTGGTATCTT 
GACCATAGAA 
GATTTTTGTG 
CTAAAAACAC 
AACGCGGCCT 
TTGCGCCGGA 
GTTCTTTCCT 
CAAGAAAGGA 
TTGAGTGAGC 
AACTCACTCG 
TCAGTGAGCG 
AGTCACTCGC 
CGCGCGTTGG 
GCGCGCAACC 
GGAAAGCGGG 
CCTTTCGCCC 
TAGGCACCCC 
ATCCGTGGGG 
AATTGTGAGC 
TTAACACTCG 
ACGAATTGAA 
TGCTTAACTT 



I 



47/53 



WO 2005/026370 



PCT/US2004/028751 



FIGURE 6A 
NY-ESO-1 

Met Gin Ala Glu Gly Arg Gly Thr Gly Gly Ser Thr Gly Asp Ala Asp 
Gly Pro Gly Gly Pro Gly He Pro Asp Gly Pro Gly Gly Asn Ala Gly 
Gly Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala 
Gly Ala Ala Arg Ala Ser Gly Pro Gly Gly Gly Ala Pro Arg Gly Pro 
His Gly Gly Ala Ala Ser Gly Leu Asn Gly Cys Cys Arg Cys Gly Ala 
Arg Gly Pro Glu Ser Arg Leu Leu Glu Phe Tyr Leu Ala Met Pro Phe 
Ala Thr Pro Met Glu Ala Glu Leu Ala Arg Arg Ser Leu Ala Gin Asp 
Ala Pro Pro Leu Pro Val Pro Gly Val. Leu Leu Lys Glu Phe Thr Val 
Ser Gly Asn He Leu Thr He Arg Leu Thr Ala Ala Asp His Arg Gin 
Leu Glu Leu Ser He Ser Ser Cys Leu Gin Gin Leu Ser Leu Leu' Met 
Trp He Thr Gin cys Phe Leu Pro Val Phe Leu Ala Gin Pro Pro Ser 
Gly Gin Arg Arg 



48/53 



WO 2005/026370 



PCT/US2004/028751 





Met 


Ser 


Pro Leu Trp 




Leu 


Pro Gly Ala Gin 


5 


Leu 


Val 


Asn Lys Glu 




Cys 


Gly Ser Gin Gin 




Arg 


Pro Trp Ser Gly 




Trp 


Pro Arg Lys Phe 




Gly 


Tyr Asn Cys Gly 


10 


Arg 


Lys 


Lys Pro Pro 




Glu 


Arg 


Glu Gin Phe 




Pro 


Asp Tyr Val He 




Gly 


Thr 


Gin Pro Gin 




Leu 


His 


Tyr Tyr Ser 


15 


Arg 


Ala He Asp Phe 




Tyr 


His 


Leu Leu Cys 




Ser 


Phe 


Ala Leu Pro 




Val 


Cys 


Thr Asp Gin 




He 


Ser Arg Asn Ser 


20 


Asp 


Asp Tyr Asn His 




Leu 


Arg Arg Asn Gin 




Asp 


He 


Arg Asp Cys 




Gin 


Asn 


Ser Thr Phe 




Asp 


Gly 


Thr Leu Asp 


25 


Phe 


Leu 


Asn Gly Thr 




Phe 


Val 


Val Leu His 




Arg 


Phe 


Asn Pro Pro 




His 


Asn 


Arg Met Tyr 




Glu 


Leu 


Phe Leu Thr 


30 


Pro 


Val 


Ser Val Glu 




Met 


Gly 


Thr Leu Val 




Gin 


Tyr 


Arg Arg Leu 




Ser 


Ser 


Lys Arg Tyr 



FIGURE 6C 
TRP-2 



Trp Gly 


Phe 


Leu 


Leu Ser 


Gly Gin 


Phe 


Pro 


Arg Val 


Cys Cys 


Pro 


Arg 


Leu Gly 


Gly Arg 


Gly 


Gin 


Cys Thr 


Pro Tyr 


He 


Leu 


Arg Asn 


Phe His 


Arg 


Thr 


Cys Lys 


Asp Cys 


Lys 


Phe 


Gly Trp 


Val He 


Arg 


Gin 


Asn He 


Leu Gly 


Ala 


Leu 


Asp Leu 


Thr Thr 


Gin 


His 


Trp Leu 


Phe Ala 


Asn 


Cys 


Ser Val 


Val Arg 


Asp 


Thr 


Leu Leu 


Ser His 


Gin 


Gly 


Pro Ala 


Leu Glu 


Arg 


Asp 


Leu Gin 


Tyr Trp 


Asn 


Phe 


Ala Thr 


Leu Phe 


Gly 


Ala 


Ala Arg 


Arg Phe 


Ser 


Ser 


Trp Glu 


Leu Val 


Thr 


Leu 


Cys Asn 


Met Gly 


Arg 


Asn 


Ser Met 


Leu Ser 


Leu 


Gin 


Lys Phe 


Ser Phe 


Arg 


Asn 


Ala Leu 


Ser Gin 


Val 


Met 


Ser Leu 


Asn Ala 


Leu 


Pro 


His Ser 


Ser Phe 


Thr 


Asp 


Ala He 


Ala Asp 


Ala 


Trp 


Pro Gin 


Asn Met 


Val 


Pro 


Phe Phe 


Ser Asp 


Gin 


Leu 


Gly Tyr 


Glu Thr 


Pro 


Gly 


Trp Pro 


Ala Leu 


Val 


Gly 


Leu Phe 


Arg- Lys 


Gly 


Tyr 


Thr Pro 


Thr Glu 


Glu 


Ala 





Cys Leu Gly Cys Lys He 
Cys Met Thr Val Asp- Ser 
Ala Glu Ser Ala. Asn Val 
Glu Val Arg Ala Asp Thr 
Gin Asp Asp Arg Glu Leu 
Cys Thr Gly Asn Phe Ala 
Thr Gly Pro Asn Cys Glu 
His Ser Leu Ser Pro Gin 
Ala Lys Lys Arg Val His 
Gly Leu Leu Gly Pro Asn 
Tyr Asp Phe Phe Val Trp 
Gly Pro Gly Arg Pro Tyr 
Phe Val Thr Trp His Arg 
Arg Leu He -Gly Asn Glu 
Gly Arg Asn Glu Cys Asp 
Pro Asp Asp Pro Thr Leu 
Thr Val Cys Asp Ser Leu 
Gly Thr Tyr Glu Gly Leu 
Lys Leu Pro Thr Leu Lys 
Asp Asn Pro Pro Phe Phe 
Glu Gly Phe Asp Lys Ala . 
His Asn Leu Val His Ser 
Ala Ala Asn Asp Pro He 
Phe Asp Glu Trp Met Lys 
Glu Leu Ala Pro He Gly 
Pro Pro Val Thr Asn -Glu 
Ser Tyr Ala He Asp Leu 
Thr Thr Leu Leu Val Val 
Val Leu Leu Ala Phe Leu 
Leu Met Glu Thr His Leu 
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FIGURE 6D 
gplOO and gplOOM 



-1 MDL VLKRCLLHLA VIGALLAVGA TKVPRNQDWL -GVSRQLRTKA WNRQLYPEWT 

2 *** ********** ********** ********** ********** ********** 

1 EAQRLDCWRG GQVSLKVSND GPTLIGANAS FSIALNFPGS QKVLPDGQVI WVNNTIINGS 

2 ********** ********** ******************** ********** ********** 

1 QVWGGQPVYP QETDDACIFP DGGPCPSGSW SQKRSFVYVW KTWGQYWQFL GGPVSGLSIG 

2 ********** ********** ********** .********** ********v* ********** 

i 

1 TGRAMLGTHT MEVTVYHRRG SRSYVPLAHS SSAFTITDQV PFSVSVSQLR ALDGGNKHFL 

2 ********** ********** ********** **'****m*** ********** ********** 

1 RNQPLTFALQ LHDPSGYLAE ADLSYTWDFG ' DSSGTLISRA LWTHTYLEP GPVTAQWLQ 

2 ********** ********** ********** ********** ********** ****v***** 

1 AAIPLTSCGS SPVPGTTDGH RPTAEAPNTT AGQVPTTEW GTTPGQAPTA EPSGTTSVQV 

2 ********** ********** ********** ********** ********** ********** 

1 PTTEVISTAP VQMPTAESTG MTPEKVPVSE VMGTTLAEMS TPEATGMTPA EVSIWLSGT 

2 ********** ********** ********** ********** ********** *********** 

1 TAAQVTTTEW VETTARELPI PEPEGPDASS IMSTESlTGS LGPLLDGTAT LRLVKRQVPL 

2 ********** ********** ********** ********** ********** ********** 

1 DCVLYRYGSF SVTLDIVQGI ESAEILQAVP SGEGDAFELT VSCQGGLPKE ACMEISSPGC 

2 ********** ********** ********** ********** ********** ********** 

1 QPPAQRLCQP VLPSPACQLV LHQILKGGSG TYCLNVSLAD TNSLAWSTQ LIMPGQEAGL 

2 ********** ********** ********** ********** ********** ********** 

. 1 GQVPLIVGIL LVLMAWLAS LIYRRRLMKQ DFSVPQLPHS SSHWLRLPRI FCSCPIGENS 
2 ********** ********** ********** ********** ********** ********** 

1 PLLSGQQV2 ******** 
Key 

*=ddentical amino acid residue 

l=gpl00 

2=gpl00M 
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FIGURE 6E 
MART-1 

Met Pro Arg Glu Asp Ala His Phe lie 
Lys Lys Gly His Gly His Ser Tyr Thr 
Ala Ala Gly lie Gly He Leu Thr Val 
Leu Leu Leu He Gly Cys Trp Tyr Cys 
Gly Tyr Arg Ala Leu Met Asp Lys Ser 
Thr Gin Cys Ala Leu Thr Arg Arg Cys 
Phe Asp His Arg Asp Ser Lys Val Ser 
Asn Cys Glu Pro Val Val Pro Asn Ala 
Glu Lys Leu Ser Ala Glu Gin Ser Pro 
Pro 



Tyr Gly Tyr Pro 
Thr Ala. Glu. Glu 
He Leu Gly Val 
Arg Arg Arg Asn 
Leu His Val Gly 
Pro Gin *<3lu Gly 
Leu Gin Glu Lys 
Pro Pro Ala Tyr 
Pro Pro Tyr * Ser 
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FIGURE 6F 
MAGE-1 



Met 


Ser 


Asp 


Asn Lys Lys Pro Asp 


Lys Ala His Ser. 


Gly Ser 


Gly 


Gly 


Asp 


Gly 


ASp 


Gly Asn Arg Cys Asn 


Leu Leu His Arg 


Tyr Ser 


Leu 


Glu 


Glu 


lie 


Leu 


Pro Tyr Lieu Gly Trp 


Leu Val Phe Ala 


Val Val 


Thr 


Thr 


Ser 


Phe 


Leu 


Ala Leu Gin Met Phe 


He Asp Ala Leu 


Tyr Glu 


Glu 


Gin 


Tyr 


Glu 


Arg 


Asp Val Ala Trp lie 


Ala Arg Gin Ser 


Lys Arg 


Met 


Ser 


Ser 


Val 


Asp 


Glu Asp Glu Asp Asp 


Glu Asp Asp Glu 


Asp Asp 


Tyr 


Tyr 


Asp 


Asp 


Glu 


Asp Asp Asp Asp Asp 


Ala Phe Tyr Asp 


Asp Glu 


Asp 


Asp 


Glu 


Glu 


Glu 


Glu Leu Glu Asn Leu 


Met Asp Asp Glu 


Ser Glu 


Asp 


Glu 


Ala 


Glu 


Glu 


Glu Met Ser Val Glu 


Met Gly Ala Gly 


Ala Glu 


Glu 


Met 


Gly 


Ala 


Gly 


Ala Asn Cys Ala Cys 


Val Pro Gly His 


His Leu 


Arg' 


Lys 


•As 11 


Glu 


Val 


Lys Cys Arg Met lie 


Tyr Phe" Phe His 


Asp Pro 


Asn 


Phe 


Leu 


Val 


Ser 


lie Pro Val Asn Pro 


Lys Glu Gin Met 


Glu Cys 


Arg 


Cys 


Glu 


Asn 


Ala 


Asp Glu Glu val Ala 


Met Glu Glu Glu 


Glu Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu Glu Glu Glu Met 


Gly Asn Pro Asp 


Gly Phe 


Ser 


Pro 



FIGURE 6G 
MAGE-3 

mpleqrsqhc kpeeglearg ealglvgaqa pateeqeaas 
ppqspqgass lpttmnyplw sqsyedssnq eeegpstfpd 
llkyrarepv tkaemlgsw gnwqyffpvi fskassslql 
clglsydgll gdnqimpkag lliivlaiia regdcape'ek 
dpkkll'tqhf. vqenyleyrq vpgsdpacye flwgpralve 
ypplhewvlr egee 



ssstlvevtl gevpaaespd 
lesefqaals rkvaelvhfl 
vfgielmevd pighlyifat 
iweelsvlev fegredsilg" 
tsyvkvlhhm vkisggphis 
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FIGURE 6H 
B7.1 

mghtrrqgts pskcpylnff qllvlaglsh fcsgvihvtk evkevatlsc ghhvsveela 
qtriywqkek kmvl trams gd mniwpeyknr tifditnnls ivilalrpsd egtyecwlk 
yekdafkreh laevtlsvka dfptpsisdf eiptsnirri icstsggfpe phlswlenge 
elnainttvs qdpetelyav sskldfnmtt nhsfmcliky ghlrvnqtfn wnttkqehfp 
dnllpswait lisvngifvi ccltycfapr crerrrnerl rresvrpv 

FIGURE 61 
LFA-3 

mvagsdagra lgvlswcll hcfgfiscfs qqiygwygn vtfhvpsnvp lkevlwkkqk 
dkvaelense frafssfknr vyldtvsgsl tiynltssde deyemespni tdtmkfflyv 
leslpsptlt caltngsiev qcmipehyns hrglimyswd cpmeqckrns tsiyfkmend 
lpqkiqctls nplfnttssi ilttcipssg hsrhryalip iplavittci vlymngilkc 
drkpdrtnsn 

FIGURE 6J 
ICAM-1* 

mapssprpal pallvllgal fpgpgnaqts vspskvilpr ggsvlvtcst scdqpkllgi 
etplpkkell lpgnnrkvye lsnvqedsqp mcysncpdgq staktfltvy wtpervelap 
lpswqpvgkn itlrcqvegg apranltwl lrgekelkre pavgepaevt ttvlvrrdhh 
ganfscrtel dlrpqglelf ehtsapyqlq tfvlpatppq lvsprvlevd tqgtvvcsld 
glfpvseaqv hlalgdqrln ptvtygndsf sakasvsvta edegtqrltc . ayilgnqsqe 
tlqtytiysf papnviltkp evsegtevtv kceahprakv tlngvpaqpl gpraqlllka 
tpedngrsfs csatlevagq lihknqtrel rvlygprlde rdcpgnwtwp ensqqtpmcq 
awgnplpelk clkdgtfplp igesvtvtrd legtylcrar stqgevtrev tvnvlsprye 
iviitvvaaa vimgtaglst ylynrqrkik kyrlqqaqkg tpmkpntqat pp 

*mature sequence begins at residue 28 (q) 
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